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(54) INFORMATION STORAGE MEDIUM AND INFORMATION RECORDING/REPRODUCING 
SYSTEM 



(57) Tliere are provided an information storage 
medium capable of real-time recording/playback of dig- 
ital moving picture information, and a digital information 
recording/playback system using this medium. In a 
medium that records/plays back data including video 



data and control information, the control information 
(DA21 in FIG. 4; RTR_VMG in FIG. 30) includes infor- 
mation (VOBU entry in FIG. 31) for accessing a specific 
portion (VOBU) of the video data. 
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Description 

Technical Field 

5 [0001] The present invention relates to an information storage medium represented by a large-capacity optical disc 
and a digital information recording/playback system using the medium. 

[0002] In particular, the present invention relates to a DVD (digital versatile disc) recording/playback system that 
considers real-time recording of a moving picture. 

[0003] The present invention also relates to a recording/playback system which can guarantee continuous playback 
10 (or continuous recording) upon continuously playing back (or continuously recording) information using playback 
devices (disc drives) having various access performances. 

[0004] Furthermore, the present invention relates to a recording/playback system which can prevent any playback 
timing errors of video information and audio information recorded on the medium. 

15 Background Art 

(Description of Prior Art) 

[0005] In recent years, systems for playing back the contents of optical discs that have recorded video (moving pic- 
20 ture) data, audio data, and the like have been developed, and have prevailed for the purpose of playing back movie soft- 
ware, karaoke, and so on like LDs (laser discs), video CDs (video compact discs), and the like. 

[0006] Among such systems, a DVD (Digital Versatile Disc) standard that uses MPEG2 (Moving Picture Experts 

Group) international standard, and adopts an audio compression scheme such as AC-3 (digital audio compression) or 
the like has been proposed. The DVD standard includes read-only DVD video (or DVD-ROM), write-once DVD-R, and 

25 erasable/rewritable DVD-RW (or DVD-RAM). 

[0007] The DVD video (DVD-ROM) standard supports MPEG2 as a movie compression scheme, and ACS audio 
and MPEG audio in addition to linear PCM as an audio recording scheme, in accordance with MPEG2 system layer. 
Furthermore, this DVD video standard is configured by appending sub-picture data for superimposed dialogs obtained 
by runlength-compressing bitmap data, and control data (navigation data) for playback control such as fastfonwarding, 

30 rewinding, data search, and the like. 

[0008] Also, this standard supports ISO9660 and UDF Bridge format to allow a computer to read data. Hence, a 
personal computer environment can handle video information of DVD video. 

(Problem) 

35 

[0009] However, a personal computer system and DVD recording/playback system use different appropriate infor- 
mation processing methods, and it is difficult for the personal computer to record/play back movie information for a long 
period of time continuously (without being interrupted). 

[0010] More specifically, in the personal computer environment, when file data is to be changed, a process for re- 

40 recording the entire changed file data on a free area of an information storage medium (HDD or the like) is done. At this 
time, the re-recording position on the information storage medium is determined irrespective of the file data recording 
position before change. The file data recording position before change is released as a small free area after the change. 
If such change of file data is frequently repeated, small free areas are scattered in a vermicular pattern at physically 
separated positions on the medium. As a result, upon recording new file data, that data is recorded on a plurality of ver- 

45 miculated free areas while being fragmented. This state is called fragmentation. 

[0011] In the information process of the personal computer, information (file data) used is readily scattered (frag- 
mented) on the disc. Even when a file to be read out has been fragmented, required information can read out from a 
disc by sequentially playing back such fragments recorded randomly. This fragmentation slightly prolongs the required 
read-out time of a file, but the user does not feel disrupted if a high-speed HDD is used. 

50 [0012] However, when recorded information (MPEG-compressed moving picture data) has been fragmented in a 
DVD recording/playback system, if such fragments recorded randomly are to be played back in turn, moving picture 
playback may often be interrupted. Especially, since an optical disc drive requires a longer seek time of an optical head 
than a high-speed disc drive such as an HDD or the like, the playback video is readily interrupted during seeking frag- 
mented information in the DVD recording/playback system that records/plays back an MPEG moving picture video 

55 on/from an optical disc (DVD-RAM disc or the like), resulting in poor practicality of the current system. 

[0013] When both personal computer data and DVD moving picture data are recorded, fragmentation is more likely 
to occur. Therefore, the DVD recording/playback system that includes the personal computer environment has no fea- 
sibility unless a very high-speed optical disc drive is developed, and a large-size buffer can be mounted at practical cost. 
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[0014] On the medium that records video information (ceils), upon repeating editing and partial deletion of recorded 

information, individual pieces of video information lie scattered or straggle on the medium. When such scattered or 
straggling video information group is to be continuously played back according to a specific order, frequent accesses 
are required. During these accesses, since a recorded information group (a series of cells) cannot be played back from 
5 the medium, playback is interrupted. 

[0015] That is, when a specific playback device (disc drive) plays back a scattered or straggling video information 
group while making frequent accesses, if the access frequency has exceeded a specific number of times, it becomes 
impossible (for that drive) to continuously output recorded information, thus disturbing seamless playback (without any 
interrupt). 

10 [0016] Furthermore, the frequency shift of the reference clocks of a normal digital audio recorder is approximately 
0.1%. When sound source information digitally recorded by a digital video tape (DAT) recorder is overdubbed on video 
information already recorded on a DVD-RAM disc by digital copy, the reference clocks between the video information 
and audio information may have an error of around 0.1%. Such reference clock error becomes so large that it cannot 
be ignored upon repeating the digital copy (or nonlinear edit using a personal computer or the like), and appears as 

15 interrupted playback tones or a phase shift between playback channels. 

[0017] In some cases, audio information corresponding to a specific video pack is stored in an audio pack at a loca- 
tion largely separated from that video pack. When a specific cell is re-recorded at another location on an information 
storage medium, synchronization between video and audio packs fails if packs under specific cells are simply moved. 
When playback is made after recording, playback tones are interrupted at that portion. 

20 

(Objects) 

[0018] It is the first object of the present invention to provide an information storage medium capable of real-time 
recording/real-time playback of digital moving picture information, and a digital information recording/playback appara- 
25 tus using this medium. 

[0019] It is the second object of the present invention to provide an information storage medium capable of seam- 
less, continuous playback free from any interrupt by managing the access frequency to a scattered or straggling 
recorded information group in correspondence with the access performance of the playback device (disc drive) used, 
and a digital information recording/playback apparatus using this medium. 

30 [0020] It is the third object of the present invention to provide an information storage medium which has special syn- 
chronization information so that video information and audio information can be synchronously played back (or inter- 
channel phase synchronization of multi-channel audio information can be taken) even when the reference clocks of the 
audio information have any error, and a digital information recording/playback apparatus using this medium. 
[0021] It is the fourth object of the present invention to provide a digital information recording/playback system 

35 which can prevent playback information from being lost (sound interrupt or the like) upon playing back specific informa- 
tion, the recording position of which has been changed, when the recording position of the specific information (specific 
cell) in, e.g., video information, has been changed by, e.g., an edit process of information recorded on an information 
storage medium. 

40 Disclosure of Invention 

[0022] In order to achieve the first object, in an information storage medium according to the present invention, 
which records and plays back data including video data and control information, the control information (DA21 in FIG. 
4; RTR_VMG in FIG. 30) includes information (VOBU entry) that accesses a specific portion (VOBU) of the video data 
45 (DA22). 

[0023] In order to achieve the first object, in an information recording system according to the present invention, 
which uses an information storage medium that can record data including video data and control information, informa- 
tion (VOBU entry) for accessing a specific portion (VOBU) of the video data (DA22) is described in the control informa- 
tion (DA21 in FIG. 4; RTR_VMG in FIG. 30) recorded on the information storage medium. 
50 [0024] In order to achieve the second object, in an information storage medium according to the present invention, 
which records a plurality of pieces of video information at discrete positions, the plurality of pieces of video information 
are recorded to decrease an access frequency to the video information to be not more than a predetermined number 
of times upon sequentially playing back the plurality of pieces of video information. 

[0025] In order to achieve the second object, in an information playback system according to the present invention, 
55 which plays back recorded information from an information storage medium which records a plurality of pieces of video 
information at discrete positions, when an access frequency to the video information exceeds a predetermined number 
of times upon sequentially playing back the plurality of pieces of video information, recording locations of the plurality 
of pieces of video information are changed to decrease the access frequency to be not more than the predetermined 
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number of times. 

[0026] In order to achieve the third object, in an information storage medium according to the present invention, 
which records audio/video data including video information, audio information, and control information, the control infor- 
mation describes audio synchronization information (VOBU information/audio synchronization information) for talking 

5 synchronization between the video information and the audio information. 

[0027] In order to achieve the third object, an information recording/playback system according to the present 
invention, which records/plays back audio/video data which includes video information, audio information, and control 
information on/from a predetermined information storage medium, describes audio synchronization information (VOBU 
information/audio synchronization information) in the control information, and synchronizes the video information and 

10 audio information on the basis of the audio synchronization information upon playback. 

[0028] In order to achieve the fourth object, in an information recording/playback system according to the present 
invention, which records and plays back audio/video data including video information, audio information, and control 
information on and from a predetermined information storage medium, audio synchronization information (VOBU infor- 
mation/audio synchronization information) is described in the control information, and when specific information in the 

15 video information is re-recorded at a different position on the information storage medium, audio information that syn- 
chronizes the video information is re-recorded at a different position on the information storage medium in accordance 
with the audio synchronization information. 

Brief Description of Drawings 

20 

[0029] 

FIG. 1 is a view for explaining the structure of a recordable/reproducible optical disc (DVD-RAIVI/DVD-RW or the 
like), and the correspondence between data recorded on the disc and recording tracks. 
25 FIG. 2 is a view for explaining the structure of a sector included in the data area shown in FIG. 1 . 

FIG. 3 is a view for explaining an EGG unit of information included in the data area shown in FIG. 1 . 

FIG. 4 is a view for explaining an example of the hierarchical structure of information recorded on the optical disc 

shown in FIG. 1 . 

FIG. 5 is a view that exemplifies the correspondence between the cell configuration of a video object and program 
30 chain PGC in the information hierarchical structure shown in FIG. 4, and the recorded contents of a lead-in area. 

FIG. 6 is a view that exemplifies the hierarchical structure of information included in a video object shown in FIG. 5. 

FIG. 7 is a view for explaining the contents of a dummy pack shown in FIG. 6. 

FIG. 8 is a view that exemplifies the internal structure of cell time information CTI shown in FIG. 4. 

FIG. 9 is a view that exemplifies the contents of VOBU information shown in FIG. 8. 
35 FIG. 10 is a view that exemplifies the hierarchical structure of information included in control information DA21 

shown in FIG. 4. 

FIG. 11 is a view for explaining a case wherein the boundary positions of video object units VOBU in a cell shown 
in FIG. 6 deviate from those of blocks (a32-kbyte EGG block is formed by 1 6 sectors (2 kbytes per sector: minimum 
unit)) that form data in the cell. 

40 FIG. 12 is a view for explaining a case wherein the boundary positions of video object units VOBU in a cell shown 

in FIG. 6 match those of blocks (2 kbytes per sector: minimum unit) that form data in the cell. 
FIG. 13 is a view for explaining a case upon playing back cell data recorded on the disc shown in FIG. 1 . 
FIG. 14 is a table for explaining an example of the relationship between cells that form playback data shown in FIG. 
13 and program chain information. 

45 FIG. 1 5 is a schematic view of a playback system for explaining continuity of a playback signal. 

FIG. 1 6 is a graph for explaining an example of the relationship between the access operations and the like and the 
temporary storage amount in a buffer memory upon continuously playing back a video signal. 
FIG. 17 is a graph for explaining another example (highest access frequency) of the relationship between the 
access operations and the like and the temporary storage amount in a buffer memory upon continuously playing 

50 back a video signal. 

FIG. 1 8 is a graph for explaining still another example (the playback time balances with the access time) of the rela- 
tionship between the access operations and the like and the temporary storage amount in a buffer memory upon 
continuously playing back a video signal. 

FIG. 19 is a graph for explaining the relationship between the seek distance and seek time of an optical head. 
55 FIG. 20 is a view for explaining the method of obtaining the average seek distance of the optical head. 

FIG. 21 is a schematic view of a recording system for explaining continuity of a recording signal. 
FIG. 22 is a view that exemplifies cells which form a part of recorded AV data (video signal information) and a 
sequence of video object units VOBU in each cell. 
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FIG. 23 is a case wherein cell #2 is edited and data falls short in the middle of cell #2 (at the position of VOBU 1 08e) 
(VOBU 108e is re-encoded) in the sequence shown in FIG. 22. 

FIG. 24 is a view for explaining changes of the cell configuration, VOBU sequence, and the position of a free area 
shown in FIG. 22 upon completion of editing in FIG. 23. 
5 FIG. 25 is a graph for explaining an example (highest access frequency) of the relationship between the access 

operations and the like and the temporary storage amount in a buffer memory upon continuously playing back a 
video signal. 

FIG. 26 is a graph for explaining still another example (the playback time balances with the access time) of the rela- 
tionship between the access operations and the like and the temporary storage amount in a buffer memory upon 
10 continuously playing back a video signal. 

FIG. 27 is a block diagram for explaining the arrangement of a DVD video recorder which can cope with a synchro- 
nization error between video and audio upon re-arranging (e.g., editing) video information in a video object. 
FIG. 28 is a block diagram showing the internal arrangement of an encoder and decoder in the arrangement shown 
in FIG. 27. 

15 FIG. 29 is a flow chart for explaining the synchronization process between video and audio in the DVD video 
recorder shown in FIG. 27. 

FIG. 30 is a view for explaining another example of the hierarchical structure of information recorded on the optical 
disc shown in FIG. 1. 

FIG. 31 is a view that exemplifies the contents (especially VOBU entry VOBU_ENT#) of time map information 
20 TMAPI shown in FIG. 30, and also exemplifies the correspondence between the contents and AV data control infor- 

mation DA210 shown in FIG. 4. 

FIG. 32 is a view that exemplifies the contents of time map general information TMAP_GI shown in FIG. 31. 

FIG. 33 is a view that exemplifies the contents of time entry TM_ENT# shown in FIG. 31 . 

FIG. 34 is a view for explaining the recorded contents of data area DA in FIG. 30, and a time entry point (access 
25 point) upon playing back a specific portion (e.g., VOBU#3) of the recorded contents. 

FIG. 35 is a view for explaining an example of the directory structure of information (data files) recorded on the opti- 
cal disc shown in FIG. 1 to have the structure shown in FIG. 30. 

FIG. 36 is a schematic view for explaining a case wherein the cell playback order of the initially recorded contents 
(original PGC) has been changed by the user later using a user-defined PGC. 
30 FIG. 37 is a view for explaining problems that will occur in audio data when video recording is interrupted before a 

GOP of MPEG-encoded video data comes to an end upon recording corresponding audio data together with 
MPEG-encoded video data. 

Best Mode for Carrying Out the Invention 

35 

[0030] A digital information recording/playback system according to an embodiment of the present invention will be 
described hereinafter with reference to the accompanying drawings. 

[0031] As a typical embodiment of a digital information recording/playback system according to the present inven- 
tion, an apparatus which records/plays back moving picture data encoded based on MPEG2, e.g., a DVD digital video 
40 recorder, is known. (A practical example of this DVD digital video recorder will be explained later.) 

[0032] FIG. 1 is a view for explaining the structure of recordable optical disc (DVD-RAM/DVD-RW disc or the like) 
10 used in the DVD digital video recorder. 

[0033] As shown in FIG. 1 , this optical disc 1 0 has a structure obtained by adhering a pair of transparent substrates 
14 each having recording layer 17 using adhesive layer 20. Each substrate 14 can be formed of a 0.6-mm thick poly- 

45 carbonate film, and adhesive layer 20 can consist of a very thin (e.g., 40 |Lim to 70 |Lim thick) ultraviolet setting resin. 
When these pair of 0.6-mm thick substrates 1 4 are adhered to each other so that their recording layers 1 7 contact each 
other on the surfaces of adhesive layer 20, a 1 .2-mm thick large-size optical disc 1 0 is obtained. 
[0034] Note that each recording layer 17 can have a ROM/RAM double-layered structure. In this case, ROM 
layer/light reflection layer (emboss layer) 17A is formed on the side closer to read-out face 19, and RAM layer/phase 

50 change recording layer 1 7B is formed on the side farther from read-out face 1 9. 

[0035] Optical disc 10 has center hole 22, and clamp areas 24 used to clamp optical disc 10 upon its rotation are 
formed around center hole 22 on the two surfaces of the disc. Center hole 22 receives the spindle of a disc motor when 
disc 1 0 is loaded into a disc drive device (not shown). Optical disc 1 0 is clamped at its clamp areas 24 by a disc clamper 
(not shown) during disc rotation. 

55 [0036] Optical disc 1 0 has information areas 25 that can record information such as video data, audio data, and the 
like around clamp areas 24. 

[0037] In each information area 25, lead-out area 26 is assured on the outer periphery side. Also, lead-in area 27 
is assured on the inner periphery side of area 25 that contacts clamp area 24. The area between lead-out and lead-in 
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areas 26 and 27 is defined as data recording area 28. 

[0038] On recording layer (light reflection layer) 17 of information area 25, a recording track is continuously formed 
in, e.g., a spiral pattern. The continuous track is divided into a plurality of physical sectors, which have serial numbers. 
Various data are recorded on optical disc 10 using those sectors as recording units. 
5 [0039] Data recording area 28 serves as an actual data recording area, and records video data (main picture data) 
such as a movie or the like, sub-picture data such as superimposed dialogs, menus, and the like, and audio data such 
as words, effect sounds, and the like as recording/playback information in the form of similar pit trains (physical shapes 
or phase states that bring about optical change in laser reflected light). 

[0040] When optical disc 10 is a double-sided recording RAIVI disc in which each surface has one recording layer, 
10 each recording layer 17 can be formed by three layers, i.e., by sandwiching a phase-change recording material layer 
(e.g., Ge2Sb2Te5) between two zinc sulfide • silicon oxide (ZnS • Si02) mixture layers. 

[0041] When optical disc 10 is a single-sided recording RAM disc in which each surface has one recording layer, 
recording layer 17 on the side of read-out face 19 can be formed by three layers including the aforementioned phase- 
change recording material layer. In this case, layer 1 7 on the side opposite to read-out face 1 9 need not be an informa- 
15 tion recording layer but may merely be a dummy layer. 

[0042] When optical disc 1 0 is a single-sided read type double-layered RAM/ROM disc, two recording layers 1 7 can 
comprise a single phase-change recording layer (on the side farther from read-out face 19; read/write), and a single 
semi-transparent metal reflection layer (on the side closer to read-out face 19; read-only). 

[0043] When optical disc 10 is a write-once DVD-R, a polycarbonate substrate is used, gold can be used as a 
20 reflection layer (not shown), and an ultraviolet setting resin can be used as a protection layer (not shown). In this case, 
an organic dye is used in recording layer 17. As the organic dyes, cyanine, squarilium, chroconic, and triphenylmen- 
thane dyes, xanthene and quinone dyes (naphthoquinone, anthraquinone, and the like), metal complex dyes (phthalo- 

cyanine, porphyrin, dithiol complex, and the like), and so forth can be used. 

[0044] Data can be written on such DVD-R disc using a semiconductor laser having a wavelength of 650 nm and 
25 an output of around 6 to 12 mW. 

[0045] When optical disc 1 0 is a single-sided read type double-layered ROM disc, two recording layers 1 7 can com- 
prise a single metal reflection layer (on the side farther from read-out face 19), and a single semi-transparent metal 

reflection layer (on the side closer to read-out face 19). 

[0046] On read-only DVD-ROM disc 1 0, pit trains are formed in advance on substrate 14 by a stamper, a reflection 
30 layer of a metal or the like is formed on the surface of substrate 1 4 on which the pit trains are formed, and the reflection 
layer is used as recording layer 1 7. On such DVD-ROM disc 1 0, grooves as recording tracks are not particularly formed, 
and the pit trains formed on the surface of substrate 14 serve as tracks. 

[0047] In various types of optical discs 10 described above, read-only ROM information is recorded on recording 
layer 1 7 as an embossed pattern signal. By contrast, no such embossed pattern signal is formed on substrate 14 having 
35 read/write (or write-once) recording layer 17, and a continuous groove is formed instead. A phase-change recording 
layer is formed on such groove. In case of a read/write DVD- RAM disc, the phase-change recording layer in land por- 
tions is also used for information recording in addition to the groove. 

[0048] When optical disc 10 is of single-sided read type (independently of one or two recording layers), substrate 
14 on the rear side viewed from read-out face 19 need not always be transparent to the read/write laser beam used. In 
40 this case, a label may be printed on the entire surface of substrate 14 on the rear side. 

[0049] A DVD digital video recorder (to be described later) can be designed to attain write many/read many 
(read/write) for a DVD-RAM disc (or DVD-RW disc), write once/read many for a DVD-R disc, and read many for a DVD- 
ROM disc. 

[0050] When disc 10 is a DVD-RAM (or DVD-RW), disc 10 itself is stored in cartridge 1 1 to protect its delicate disc 
45 surface. When DVD-RAM disc 1 0 in cartridge 1 1 is inserted into the disc drive of a DVD video recorder (to be described 
later), disc 1 0 is pulled out from cartridge 1 1 , is clamped by the turntable of a spindle motor (not shown), and is rotated 
to face an optical head (not shown). 

[0051] On the other hand, when disc 10 is a DVD-R or DVD-ROM, disc 10 itself is not stored in cartridge 11, and 

bare disc 1 0 is directly set on the disc tray of a disc drive. 
50 [0052] FIG. 1 also shows the correspondence between data recording area 28 of optical disc (DVD-RAM or the like) 
1 0 and recording tracks of data recorded there. 

[0053] Recording layer 1 7 of information area 25 is formed with a continuous data recording track in a spiral pattern. 
The continuous track is segmented into a plurality of logical sectors (minimum recording units) each having a given stor- 
age size, and data are recorded with reference to these logical sectors. The recording size per logical sector is deter- 
55 mined to be 2,048 bytes (or 2 kbytes) which are equal to one pack data length. 

[0054] Data recording area 28 is an actual data recording area, which similarly records management data, main 
picture (video) data, sub-picture data, and/or audio data. 

[0055] Note that data recording area 28 of disc 10 can be segmented into a plurality of ring-shaped (annular) 
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recording areas (a plurality of recording zones), although not shown. The disc rotational velocity varies in units of 
recording zones. However, within each zone, a constant linear or angular velocity can be set. In this case, an auxiliary 
recording area, i.e., spare area (free space) can be provided for each zone. These free spaces in units of zones may 
collectively form a reserve area for that disc 1 0. 

5 [0056] The recording signal structure of information recorded on an information storage medium (DVD-RAM disc 
10 or the like) and the method of generating the recording signal structure will be explained below. Note that the con- 
tents themselves of information recorded on the medium are referred to as "information", and a structure or expression 
obtained by scrambling or modulating information with identical contents, i.e., a sequence of "1 " and "0" states after sig- 
nal format conversion, is expressed as a "signal" to appropriately distinguish them from each other. 

10 [0057] FIG. 2 is a view for explaining the structure of a sector included in the data area shown in FIG. 1 . One sector 
shown in FIG. 2 corresponds to one of sector numbers of 2,048-byte sectors shown in FIG. 1 . Each sector alternately 
includes synchronization codes and modulated signals (video data and the like) to have a header embossed on disc 10. 
[0058] FIG. 3 is a view for explaining a recording unit (a unit of error correction code ECC) of information included 
in the data area shown in FIG. 1 . 

15 [0059] In a FAT (file allocation table) prevalently used in file systems of information storage media (hard disc HDD, 
magnetooptical disc MO, and the like) for personal computers, information is recorded on an information storage 
medium to have 256 or 512 bytes as a minimum unit. 

[0060] By contrast, in information storage media such as a CD-ROM, DVD-ROM, DVD-RAM, and the like, UDF 
(universal disc format) is used as a file system. In this case, information is recorded on an information storage medium 

20 to have 2,048 bytes as a minimum unit. This minimum unit is called a sector. That is, each 2,048-byte information is 
recorded on an information storage medium (optical disc 10) using UDF in units of sectors 501, as shown in FIG. 3. 
[0061] Since a CD-ROM and DVD-RAM are handled as bare discs without using any cartridge, the surface of an 
information storage medium is readily scratched or becomes attached with dust on the user side. For this reason, a spe- 
cific sector (e.g., sector 501c in FIG. 3) cannot often be played back (or recorded) due to the influences of dust or 

25 scratches on the information storage medium surface. 

[0062] DVD adopts error correction (ECC using a product code) in consideration of such situation. More specifically, 
1 6 sectors (1 6 sectors from sector 501 a to sector 501 p in FIG. 3) form one ECC (error correction code) block 502, which 
has a strong error correction function. As a result, even when an error in ECC block 502 (e.g., sector 501 c is impossible 
to play back) has occurred, such error can be corrected, and all pieces of information in ECC block 502 can be correctly 

30 played back. 

[0063] FIG. 4 is a view for explaining an example of the hierarchical structure of information recorded on optical disc 
(especially DVD-RAM or DVD-RW) disc 10 shown in FIG. 1. 

[0064] Lead-in area 27 includes an embossed data zone whose light reflection surface has an embossed pattern, 
a mirror zone whose surface is flat (mirror surface), and a rewritable data zone capable of information rewrites. 
35 [0065] Data recording area (volume space) 28 is comprised of user rewritable volume/file management information 
70, and data area DA. 

[0066] Data area DA between lead-in and lead-out areas 27 and 26 can record both computer data and AV data. 
The recording order and recording information sizes of computer data and AV data are arbitrary, and an area where 
computer data is recorded is named a computer data area (DAI , DA3), and an area where AV data is recorded is 
40 named an AV data area (DA2). 

[0067] Volume/file management information 70 can record information that pertains to the entire volume, the 
number of files computer data (data of a personal computer) and the number of files associated with AV data included 
in volume space 28, and information that pertains to recording layer information and the like. 
[0068] Especially, the recording layer Information can contain: 

45 

* the number of building layers (for example, two layers in case of a single ROM/RAM double-layered disc, also two 
layers in case of a single double-layered disc having only a ROM layer, and n layers in case of n single-sided, sin- 
gle-layered discs irrespective of ROM or RAM layers); 

logical sector number range tables (indicating the size of each layer) assigned in units of layers; 
50 * characteristics (a DVD-RAM disc, a RAM portion of a ROM/RAM double-layered disc, a DVD-R, CD-ROM, CD-R, 
and the like) in units of layers; 

assigned logical sector number range tables (including rewritable area size information in units of layers) in units of 
zones of a RAM area of each layer; and 

unique ID information (to find out disc exchange in a multiple disc pack) in units of layers. 

55 

[0069] With the recording layer information including the aforementioned contents, even a multiple disc pack and a 
ROM/RAM double-layered disc can be handled as a single, large volume space by setting series logical sector num- 
bers. 
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[0070] Data area DA records computer data, video data, audio data, and tlie like. Volume/fiie management infor- 
mation 70 records information that pertains to files of audio/video data recorded on data area DA or tine entire volume. 
[0071] Lead-out area 26 is also capable of information rewrites. 
[0072] The embossed data zone of lead-in area 27 records, for example, in advance: 

01 )information which pertains to the entire information storage medium: the disc type (a DVD-ROM, DVD-RAM (or 
DVD-RW), DVD-R, or the like); disc size (12 cm, 8 cm, or the like); recording density; physical sector numbers indi- 
cating the recording start/end positions, and the like; 

02) information which pertains to the recording/playback/erasure characteristics: the recording power and record- 
ing pulse width; erase power; playback power; linear velocity upon recording and erasure, and the like; and 
03>information which pertains to the manufacture of each information storage medium: the manufacturing number 
and the like. 

The rewritable zone of each of lead-in area 27 and lead-out area 26, for example, includes: 
04 )a field for recording a unique disc name of each information storage medium; 

05 >a test recording field (for confirming recording/erasure conditions); and 

06 )a field for recording management information that pertains to defective fields in data area DA. 

On fields 04) to 06 J a DVD recording apparatus (a dedicated DVD video recorder, a personal computer 
installed with a DVD video processing board and processing software, or the like) can record information. 

[0073] Data area DA can record audio/video data DA2 and computer data DAI and DAS together. 

[0074] Note that the recording order, recording information size, and the like of computer data and audio/video data 

are arbitrary. Data area DA can record computer data or audio/video data alone. 

[0075] Audio/video data area DA2 includes control information DA21 , video object DA22, picture object DA23, and 

audio object DA24. 

[0076] At the first position of audio/video data area DA2, anchor pointer AP having information that indicates the 
recording location of control information DA21 is present. When the information recording/playback system uses infor- 
mation in this audio/video data area DA2, the recording location of control information DA21 is checked based on 
anchor pointer AP, and control information DA21 is read by accessing that location. 
[0077] Video object DA22 includes information of the contents of recorded video data. 

[0078] Picture object DA23 can include still picture information such as still pictures, slide pictures, thumbnail pic- 
tures that represent the contents of video object DA22 used upon search/edit, and the like. 
[0079] Audio object DA24 includes information of the contents of recorded audio data. 

[0080] Note that recording information of the playback target (contents) of audio/video data is included in video 
object set VOBS shown in FIG. 5 (to be described later). 

[0081] Control information DA21 includes AV data control information DA21 0, playback control information DA21 1 , 

recording control information DA212, edit control information DA213, and thumbnail picture control information DA214. 
[0082] AV data control information DA210 includes information which manages the data structure in video object 
DA22 and manages information that pertains to the recording locations on information storage medium (optical disc or 
the like) 10, and information CIRWNs indicating the number of times of rewrite of control information. 
[0083] Playback control information DA21 1 includes control information required upon playback, and has a function 
of designating a sequence of program chains PGC. More specifically, playback control information DA21 1 includes: 
information that pertains to a playback sequence which combines PGGs; information indicating a "pseudo recording 
location" while considering information storage medium 10 as, e.g., a single tape (digital video cassette DVC or video 
tape VTR) in association with that information (a sequence for continuously playing back all recorded cells); information 
that pertains to multi-screen simultaneous playback having different video information contents; search information 
(information that records corresponding cell IDs and a table of the start times of cells in units of search categories, and 
allows the user to select a category to directly access the video information of interest); and the like. 
[0084] With this playback control information DA21 1 , the file name of an AV file, the path of a directory name, the 
ID of PGC, and the cell ID can be designated. 

[0085] Recording control information DA212 includes control information (programmable timer recording informa- 
tion or the like) required upon recording (video recording and/or audio recording). 

[0086] Edit control information DA213 includes control information required upon edit. For example, edit control 
information DA213 can include special edit information (EDL information such as corresponding time setting informa- 
tion, special edit contents, and the like) in units of PGCs, and file conversion information (information that converts a 
specific portion in an AV file, and designates the file storage location after conversion, or the like). 
[0087] Thumbnail picture control information DA214 includes management information that pertains to thumbnail 
pictures used to search for a scene that the user wants to see in video data or those to be edited, and thumbnail picture 
data. 
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[0088] Thumbnail picture control information DA214 can include a picture address table, thumbnail picture data, 
and the like. Thumbnail picture control information DA214 can also include, as lower-layer information of the picture 
address table and thumbnail picture data, menu index information, index picture information, slide & still picture infor- 
mation, information picture information, defective area information, wallpaper picture information, and the like (not 
5 shown). 

[0089] AV data control information DA210 includes allocation map table AMT, program chain control information 

PGCCI, and cell time control information CTCI. 

[0090] Allocation map table AUT includes information that pertains to address setups along the actual data alloca- 
tion, identification of recorded/unrecorded areas, and the like on the information storage medium (optical disc 10 or the 
10 like). In the example shown in FIG. 4, allocation map table AMT includes user area allocation descriptor UAD, spare 
area allocation descriptor SAD, and address conversion table ACT. 

[0091] Program chain control information PGCCI includes information that pertains to a video playback program 
(sequence). 

[0092] Cell time control information CTCI includes information that pertains to the data structure of a basic unit 
15 (cell) of video information. This cell time control information CTCI includes cell time control general information CTCGI, 

cell time search information CTSI, and m pieces of cell time search information CTI#1 to CTI#m. 

[0093] Cell time control general information CTCGI includes information that pertains to individual cells. Cell time 

search information CTSI is map information indicating a description position (AV address) of corresponding cell time 

information when a specific cell ID is designated. 
20 [0094] Each cell time search information (CTI#m) is comprised of cell time general information CTGI#m and cell 

VOBU table CVT#m. Details of cell time search information (CTI#m) will be explained later with reference to FIG. 8. 

[0095] An outline of FIG. 4 has been explained. Supplementary explanations of the individual information will be 

summarized below. 

[0096] (1 1 )Volume/file management information 70 includes: 

25 

information that pertains to entire volume space 28; 

the number of files of computer data (DA1 , DAS) and audio/video data (AV data DA2) included in volume space 28; 
the recording layer information of the information storage medium (DVD-RAM disc, DVD-ROM disc, or DVD- 
ROM/RAM multi-layered disc); and the like. 

30 

[0097] The recording layer information records: 

the number of building layers (example: the number of layers of a single RAM/ROM double-layered disc is counted 
as two, that of a single ROM double-layered disc is also counted as two, and that of n single-sided discs is counted 
35 as n); 

logical sector number range tables (corresponding to the size of each layer) assigned in units of layers; 
characteristics (example: a DVD-RAM disc, a RAM portion of a ROM/RAM double-layered disc, CD-ROM, CD-R, 
and the like) in units of layers; 

assigned logical sector number range tables (including rewritable area size information in units of layers) in units of 
40 zones of a RAM area of each layer; 

unique ID information (e.g., to find out disc exchange in a multiple disc pack) in units of layers; and the like. With 
this information, series logical sector numbers can be set even for a multiple disc pack or RAM/ROM double-lay- 
ered disc to handle it as a single, large volume space. 

45 [0098] (12)Playback control information DA211 records: 

information that pertains to a playback sequence which combines PGCs; 

"information indicating a pseudo recording location" while considering information storage medium 10 as, e.g., a 
single tape (digital video cassette DVC or video tape VTR) in association with the playback sequence that com- 
50 bines PGCs (a sequence for continuously playing back all recorded cells); 

information that pertains to multi-screen simultaneous playback having different video information contents; 
search information (information that records corresponding cell IDs and a table of the start times of cells in units of 
search categories, and allows the user to select a category to directly access the video information of interest); and 
the like. 

55 

[0099] (l3)Recording control information DA212 records: 
programmable timer recording information; and the like. 
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[0100] (14) Edit control information DA213 records: 

special edit information (tliat describes corresponding time setting information and special edit contents as an edit 
library (EDL)) in units of PGCs; 
5 file conversion information (information that converts a specific portion in an AV file, and designates the file storage 

location after conversion, or the like); and the like. 

[0101] FIG. 5 exemplifies the correspondence between the cell configuration of a video object and program chain 
PGC in the information hierarchical structure shown in FIG. 4, and also exemplifies the recorded contents of the lead- 
10 in area. 

[0102] In the information hierarchical structure shown in FIG. 5, video object DA22 is comprised of video object set 
VOBS. This VOBS has contents corresponding to one or more program chains PGC#1 to PGC#k which respectively 
designate the cell playback order in different methods. 

[0103] A video object set (VOBS) is defined as a set of one or more video objects (VOB). Video objects VOB in 
15 video object set VOBS are used for the same purpose. 

[0104] For example, a VOBS for a menu normally consists of one VOB, which stores a plurality of menu screen dis- 
play data. By contrast, a VOBS for a title set normally consists of a plurality of VOBs. 

[0105] Taking a concert video title of a certain rock band as an example, VOBs that form a video object set 
(VTSTT_VOBS) for a title set correspond to picture data of the performance of that band. In this case, by designating a 
20 given VOB, for example, the third tune in the concert of that band can be played back. 

[0106] A VOB that forms video object set VTSM_VOBS for a menu stores menu data of all the tunes performed in 
the concert of the band, and a specific tune, e.g., an encore, can be played back according to the menu display. 
[0107] Note that one VOB can form one VOBS in a normal video program. In this case, a single video stream 
comes to an end in one VOB. 

25 [0108] On the other hand, in case of a collection of animations having a plurality of stories or an omnibus movie, a 
plurality of video streams (a plurality of video chains PGC) can be set in a single VOBS in correspondence with the 
respective stories. In this case, the individual video streams are stored in corresponding VOBs. An audio stream and 

sub-picture stream pertaining to each video stream end in the corresponding VOB. 

[0109] VOBs are assigned identification numbers (VOB_IDN#i; i = 0 to i), and that VOB can be specified by the 
30 identification number. A VOB consists of one or a plurality of cells. A normal video stream consists of one or a plurality 
of cells, but a video stream for a menu often consists of a single cell. Cells are assigned identification numbers 
(C_IDN#j) like VOBs. 

[0110] FIG. 5 also exemplifies the logical structure of information recorded on lead-in area 27 of optical disc 10 
shown in FIG. 1 . 

35 [0111] When disc 10 is set in a DVD video recorder (not shown) (or a DV video player; not shown), information in 
lead-in area 27 is read first. Lead-in area 27 records a predetermined reference code and control data in ascending 
order of sector number. 

[0112] The reference code in lead-in area 27 includes a predetermined pattern (a repetitive pattern of a specific 
symbol "172") and consists of two error correction code blocks (ECC blocks). Each ECC block has 16 sectors. These 
40 two ECC blocks (32 sectors) are generated by appending scramble data. Upon playing back the reference code 
appended with the scramble data, filter operation or the like on the playback side is done to play back a specific data 
symbol, (e.g., 172) to assure precision in subsequent data reads. 

[0113] Control data in lead-in area 27 is made up of 1 92 ECC blocks. In this control data field, the contents for 1 6 
sectors in the respective blocks are repetitively recorded 192 times. 

45 [0114] FIG. 6 exemplifies the hierarchical structure of information included in video object DA22 shown in FIG. 5. 
[01 1 5] As shown in FIG. 6, each cell (for example, cell #m) consists of one or more video object units (VOBU). Each 
video object unit is constituted as a set (pack sequence) of video packs, sub-picture packs, and audio packs. 
[0116] Each of these packs has a size of 2,048 bytes, and serves as a minimum unit for data transfer. The minimum 
unit for logical processing is a cell, and logical processing is done in units of cells. 

50 [0117] The playback time of video object unit VOBU corresponds to that of video data made up of one or more pic- 
ture groups (groups of pictures; to be abbreviated as GOPs), and is set to fall within the range from 0.4 sec to 1.2 sec. 
One GOP is screen data which normally has a playback time of about 0.5 sec in the MPEG format, and is compressed 
to play back approximately 15 frame pictures during this interval. 

[0118] When video object unit VOBU includes video data, a video datastream is formed by arranging GOPs (com- 
55 plying with MPEG) each consisting of video packs, sub-picture packs, audio packs, and the like. Also, video object unit 
VOBU is defined by one or more GOPs. 

[0119] Even playback data consisting of audio data and/or sub-picture data alone is formed using video object unit 
VOBU as one unit. For example, when video object unit VOBU is formed by audio packs alone, audio packs to be played 
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back in the playback time of video object unit VOBU to wliich the audio data belong are stored in that video object unit 
VOBU as in the video object of video data. 

[0120] The packs that form each video object unit VOBU have the same data structure except for a dummy pack. 
Taking an audio pack as an example, as shown in FIG. 6, a pack header is set at the head of the pack, and a packet 
5 header, substream ID, and audio data follow. In such pack configuration, the packet header is written with information 
of presentation time stamp PTS indicating the head time of the first frame in a packet. 

[0121] With a DVD video recorder that can record video title set VTS (or video program) containing video object 
DA22 with the structure shown in FIG. 6 on optical disc 10, the user often wants to edit the recorded contents after this 
VTS is recorded. In order to meet such requirement, dummy packs can be appropriately inserted in each VOBU. Each 
10 dummy pack can be used to record edit data later. 

[0122] Information that pertains to cells #1 to #m shown in FIG. 6 is recorded in cell time control information CTCI 
in FIG. 4 and, as shown in FIG. 4, its contents include: 

cell time information CTI#1 to cell time information CTI#m (information that pertains to each cell); 
15 cell time search information CTSI (map information indicating a description position (AV address) of corresponding 

cell time information when a specific cell ID is designated); and 

cell time control general information CTCGI (information that pertains to the entire cell information). 

[0123] Each cell time information (e.g., CTI#m) includes cell time general information (CTGI#m) and a cell VOBU 
20 table (CVT#m). 

[0124] The data structure in video object DA22 will be explained below. 

[0125] A minimum basic unit of video information is called a cell. Data in video object DA22 is configured as a set 

of one or more cells #1 to #m, as shown in FIG. 6. 

[0126] l\/IPEG2 (or MPEG1) is prevalently used as the video information compression technique in video object 
25 DA22. MPEG segments video information into groups called GOPs in 0.5-sec increments, and compresses video infor- 
mation in units of GOPs. A video information compression unit called video object unit VOBU is formed by one or more 
GOPs. 

[0127] In the present invention, the VOBU size is set to match an integer multiple of ECC block size (32 kbytes) (one 
of important features of the present invention). 
30 [0128] Furthermore, each VOBU is segmented into packs in units of 2,048 bytes, and these packs record raw video 
information (video data), audio information (audio data), sub-picture information (superimposed dialog data, menu data, 
and the like), dummy information and the like. These are recorded in the form of video packs, audio packs, sub-picture 
packs, and dummy packs. 

[0129] Note that the dummy pack is inserted for the purposes of: 

35 

addition of information to be additionally recorded after video recording (for example, memo information indicating 
that postrecording information is inserted into an audio pack and is replaced by a dummy pack is inserted in a sub- 
picture pack as sub-picture information and is replaced by a dummy pack); 

compensating for a size that is short from an integer multiple of 32 kbytes to adjust the VOBU size just to an integer 
40 multiple of ECC block size (32 kbytes); and the like. 

[0130] In each pack, a pack header and packet header (and substream ID) are set before object data (audio data 
in case of, e.g., an audio pack). 

[0131] In the DVD video format, an audio pack and sub-picture pack include the substream ID between the packet 
45 header and object data. 

[0132] In the packet header, a time code for time management is recorded. Taking an audio packet as an example, 
PTS (presentation time stamp) information that records the head time of the first audio frame in that packet is inserted 
in the form shown in FIG. 6. 

[0133] FIG. 7 shows the structure of the contents (for one dummy pack) of the dummy pack shown in FIG. 6. That 
50 is, one dummy pack 89 is comprised of pack header 891 , packet header 892 having a predetermined stream ID, and 
padding data 893 padded with a predetermined code (insignificant data). (Packet header 892 and padding data 893 
form padding packet 890). The contents of padding data 893 of an unused dummy pack do not have any special mean- 
ing. 

[0134] This dummy pack 89 can be used as needed when the recorded contents are edited after predetermined 
55 recording is done on disc 1 0 shown in FIG. 1 . Also, dummy pack 89 can be used to store thumbnail picture data, which 
is used for a user menu. Furthermore, dummy pack 89 can be used for the purpose of matching each VOBU size in AV 
data DA2 with an integer multiple of 32 kbytes (32-kbyte align). 

[0135] For example, a case will be examined below wherein the contents of a video tape that recorded a family trip 
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using a portable video camera are recorded and edited on DVD-RAM (or DVD-RW) disc 10. 

[0136] In this case, only tlie video scenes to be stored in a single disc are selectively recorded on disc 10. Tliese 
video scenes are recorded in a video pack in FIG. 6. Also, audio data simultaneously recorded by the video camera is 
recorded in an audio pack. 

5 [0137] Each VOBU that includes these video pack, audio pack, and the like can have a navigation pack (not 
shown), which is adopted in DVD video, at its beginning, as needed. 

[0138] This navigation pack contains playback or presentation control information PCI and data search information 
DSI. Using this PCI or DSI, the playback procedure of each VOBU can be controlled (for example, discontinuous scenes 
can be automatically connected or a multiangle scene can be recorded). 
10 [0139] Alternatively, each VOBU may have a synchronization navigation pack (SNV_PCK; not shown) which simply 
has synchronization information in units of VOBUs without having contents as complex as those of a navigation pack of 
DVD video. 

[0140] Note that a DVD video RAM assumes a case without using any navigation pack currently. However, a DVD- 
R may use a navigation pack. 

15 [0141] After the contents of the video tape are edited and recorded on disc 1 0, when a voice, effect sound, and the 
like are to be postrecorded in each scene in units of VOBUs or background music BGM is added, such postrecorded 
audio data or BGM can be recorded In dummy pack 89. When a comment for the recorded contents Is to be added, sub- 
pictures such as additional characters, figures, and the like can be recorded In dummy pack 89. Furthermore, when an 
additional video picture Is to be inserted, the inserted video picture can be recorded in dummy pack 89. 

20 [0142] The above-mentioned postrecorded audio data or the like is written in padding data 893 of dummy pack 89 
used as an audio pack. The additional comment is written in padding data 893 of dummy pack 89 used as a sub-picture 
pack. Similarly, the Inserted video picture Is written In padding data 893 of dummy pack 89 used as a video pack. 
[0143] Furthermore, when each VOBU size including the recorded/edited pack sequence does not match an inte- 
ger multiple of ECC block size (32 kbytes), dummy pack 89 which includes as padding data 893 insignificant data that 

25 can match this VOBU size with an integer multiple of 32 kbytes can be inserted into each VOBU. 

[0144] In this manner, by appropriately inserting a dummy pack (padding pack) into each recorded/edited VOBU to 
match each VOBU size with an Integer multiple of ECC block size (the aforementioned 32-kbyte align), all VOBUs can 
be always be rewritten in units of ECC blocks. 

[0145] Alternatively, when 32-kbyte align is done, if a RAM layer of disc 10 has suffered a defect, only the defect 
30 portion can be replaced In units of ECC blocks. Furthermore, when the ECC block unit Is used as the address unit of 
AV data, each VOBU address can be easily converted. 

[0146] That Is, dummy pack 89 is a wildcard pack that can become any of audio, sub-picture, and video packs 
depending on its purpose. 

[0147] FIG. 8 is a view for explaining the internal structure of cell time information CTI shown in FIG. 4. 
35 [0148] As has been explained In the description of FIG. 4, each cell time search Information (CTI#m) Is made up of 

cell time general information CTGI#m and cell VOBU table CVT#m. 

[0149] As shown in the upper half of FIG. 8, the cell time general information includes: 

(1) cell data general information; 
40 (2) a time code table; 

(3) acquired defect information; 

(4) cell video information; 

(5) cell audio Information; and 

(6) cell sub-picture Information. 

45 

[0150] Cell data general information (1 ) contains a cell ID, the total time duration of that cell, the number of cell data 
sets (extents), a cell data set descriptor, a cell time physical size, and the number of constituent VOBUs of that cell. 
[0151] Note that the cell ID Is a unique ID In units of cells. The total time duration indicates the total time required 
for playing back that cell. 

50 [0152] The number of cell data sets (number of extents) indicate the number of cell data set descriptors in that cell. 

[0153] The description contents of the cell data set descriptor (cell data extent descriptor) will be explained below. 

[0154] Assume that a recording information cluster, which pertains to a single cell In the layout order of ECC blocks 

that can be used Is one cell data set (cell data extent). In this case, specific cell #1 is considered as one cell data set 

unless it is divided by another cell #2. 
55 [0155] As an example of a description method, the length (the number of ECC blocks where a cell data set is 

recorded) of the cell data set Is expressed by "2 bytes", the start address (AV address) of the cell data set Is expressed 

by "3 bytes", and they are described at neighboring positions. For example, we have: 
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Cell data set descriptor (the number of ECC blocks, start address) = CED (*,*) 

[0156] A statement that describes all cell data sets that form one cell is a cell data set descriptor. With this descrip- 
tor, the distribution of all AV addresses where cells are recorded can be determined, thus allowing easy access. 

5 [0157] Since the lengths of cell data sets and the start AV addresses of cell data sets are described in pairs, when 
many continuously recorded arias are formed on information storage medium 10, the number of bytes required for 
describing the cell data set descriptor decreases, the data size required for the cell time general information (#m) 
decreases, and the recording size that can be used for video object DA22 can be relatively increased accordingly. 
[0158] Note that corresponding AV addresses viewed along the layout of information storage medium 1 0 are often 

10 arranged in a discontinuous order. However, since allocation map table AMT shown in FIG. 4 is available, the recording 
locations of all data in a given cell on the information storage medium can be specified by the start AV address in the 
cell data set descriptor. 

[0159] In FIG. 8, the cell time physical size indicates a recording location size on the information storage medium 
where a cell including an inherent defect location is recorded. By combining this cell time physical size and the total time 
15 duration, the size of an inherent defective area in a given cell can be detected, and a practical transfer or transmission 
rate can be expected. This cell time physical size can be used to determine a recording location candidate of a cell that 
can guarantee continuous playback. 

[0160] The number of constituent VOBUs indicates the number of VOBUs that constitute that cell. 
[0161] Time code table (2) includes information of the number of pictures in VOBUs #1 to #n which form the cell, 
20 and information of the number of ECC blocks in VOBUs #1 to #n which form the cell (as shown in FIG. 3, since one ECC 
block = 16 sectors, the information of the number of ECC blocks can also be expressed by information of the number 
of sectors). 

[0162] A time code in this table is expressed by a pair of the number of pictures (the number of video frames; 
expressed by 1 byte) in units of VOBUs in the cell of interest, and the number of used ECC blocks (expressed by 1 byte) 
25 in units of VOBUs at the recording location on the medium indicated by the cell data set descriptor. Using this expres- 
sion method, the time code can be recorded in a very small information size (compared to a case wherein time codes 
are appended to each of 30 frames per sec in NTSC). 

[0163] Acquired defect information (3) includes the number of acquired defects in that cell, and information of the 
addresses of the acquired defects. 
30 [0164] The number of acquired defects indicates the number of ECC blocks that have suffered acquired defects In 
that cell. The acquired defect address indicates the location of each acquired defect by an AV address in units of ECC 
blocks. Every time a defect is produced upon cell playback (i.e., ECC error correction fails), the AV address of the defec- 
tive ECC block is registered as the acquired defect address. 

[0165] Cell video information (4) includes information such as the type of video information (NTSC, PAL, or the like) 
35 of that cell, a compression method (MPEG2, MPEG1 , motion JPEG, or the like), a stream ID and substream ID (main 
screen or sub screen; used in multi-screen simultaneous recording/playback), a maximum transmission rate, and the 
like. 

[0166] Cell audio information (5) includes information such as the type of an audio signal (linear PCM, MPEG1, 
MPEG2, Dolby AC-3, or the like), a sampling frequency (48 kHz or 96 kHz), the number of quantization bits (1 6 bits, 20 
40 bits, or 24 bits), and the like. 

[0167] Cell sub-picture information (6) includes the number of sub-picture streams in each cell and information indi- 
cating their recording locations. 

[0168] On the other hand, the cell VOBU table includes VOBU information #1 to VOBU information #n which form 
that cell, as shown in the lower half of FIG. 8. Each VOBU information includes VOBU general information, dummy pack 
45 information, and audio synchronization information. 

[0169] The individual information contents in the cell time information (CTI#m) in FIG. 8 can be summarized as fol- 
lows: 

(11) cell data general information (general information that pertains to each cell and includes the following con- 
so tents); 

(11.1) cell ID (unique identifier for each cell) 

(1 1 .2) total time duration (total required time required for playing back cell contents) 

(1 1 .3) the number of cell data sets (the number of cell data set descriptors in a cell) 
55 (1 1 .4) cell data set descriptor 

(11.5) cell time physical size (which indicates the recording location size on the information storage medium 
where a cell including an inherent defect location is recorded. By combining with the aforementioned "total time 
duration", the size of an inherent defective area in the cell can be determined, and a practical transmission rate 
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can be expected. This information is used to "determine a recording location candidate of a cell that can guar- 
antee continuous playback".) 

(1 1 .6) the number of constituent VOBUs (the number of VOBUs that constitute a cell) 

5 (12) time code table; 

(13) acquired defect information (acquired defect information detected in a cell, which includes the following con- 
tents); 

(13.1) the number of acquired defects (the number of ECC blocks in which acquired defects that have suffered 
10 acquired defects in a cell) 

(13.2) acquired defect address (which indicates the location of acquired defect by an AV address value in units 
of ECC blocks. The address value is registered as needed every time a defect is produced upon playback of a 
cell.) 

15 (14) cell video information (including the following contents); 

(14.1) video signal type (NTSC or PAL) 

(14.2) compression method (MPEG2, MPEG1, or motion JPEG) 

(14.3) stream ID and substream ID information (main screen or sub screen R used in multi-screen simultane- 

20 ous recording/playback) 

(14.4) maximum transmission rate 

(15) cell audio information (including the following contents); 

25 (15.1) signal type (linear PCM, MPEG1, MPEG2, or Dolby AC-3) 

(15.2) sampling frequency 

(15.3) the number of quantization bits 

(16) cell sub-picture information (which indicates the number of streams of sub-picture information in each call and 
30 their recording locations.) 

[0170] The aforementioned "time code table" is expressed by pairs of the numbers (the numbers of frames: 
expressed by 1 byte) #1 to #n of pictures in units of VOBUs in a cell, and the numbers (expressed by 1 byte) of used 
ECC blocks in units of VOBUs at the recording locations on the information storage medium indicated by the "cell data 
35 set descriptor", as indicated in the upper portion in FIG. 8. 

[0171] Using this expression method, the time code can be recorded with a very small information size. 
[0172] An access method using this time code will be explained below. 

1 . A video recording/playback application designates the cell ID to be accessed and its time; 
40 2. the video management layer detects the picture number (frame number) of the corresponding picture (video 

frame) from the cell start position on the basis of the designated time; 

3. the video management layer computes by sequentially summing up the number of pictures (the number of 
frames) in units of VOBUs from the head of the cell shown in FIG. 8 to detect a picture number (frame number) and 
VOBU number from the head to which the picture (frame) designated by the video recording/playback application 

45 corresponds; 

4. the recording locations of all data in the cell on the information storage medium are detected from the cell data 
set descriptor shown in FIG. 8 and allocation map table AMT shown in FIG. 4; 

5. the values of the numbers (#1 to #n) of ECC blocks of VOBUs (#n) in FIG. 8 are summed up to the VOBU number 

(#n) detected in item "3." above, and the AV address at the corresponding VOBU start position is checked; 
50 6. the corresponding VOBU start position is directly accessed on the basis of the result in item "5." above to trace 

until the predetermined picture (frame) obtained in item "3." above is reached; and 

7. at this time, when l-picture recording end position information in the VOBU to be accessed is required, l-picture 
end position information in FIG. 9 is used. 

55 [0173] FIG. 9 is a view for explaining the internal structure of the cell VOBU table (VOBU information) shown in FIG. 
8. 

[0174] Time management information (presentation time stamp PTS) that pertains to audio information is recorded 
in a packet header, as shown in FIG. 6. In order to extract this information (PTS), information of an audio pack must be 
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directly played back. However, since the audio pack is recorded at a location deep inside tine management layer, editing 
of video information in units of cells becomes time-consuming. 

[0175] To combat this problem of "time-consuming editing in units of cells" and the like, synchronization information 
for audio information is provided in AV data control information DA210 in FIG. 4. This synchronization information is 
5 audio synchronization information shown in FIG. 9. 

[0176] Referring to FIG. 9, VOBU general information indicates the end position of l-picture of MPEG-encoded 
video information, and is expressed by a differential address from the start position of a VOBU at the last position of I- 
picture (1 byte). 

[0177] Dummy pack information is expressed by the number of dummy packs (1 byte) indicating the number of 

10 dummy packs (FIG. 7) inserted into respective VOBUs, and the dummy pack distribution (dummy pack numbers X 2 
bytes) including the differential address from the start position of a given VOBU to the dummy pack insertion position 
(2 bytes) and the individual numbers of dummy packs (2 bytes). 

[0178] Audio synchronization information is expressed by an audio stream channel number (1 byte) indicating the 
number of channels of an audio stream, l-picture audio positions #1, #2, ... (1 byte each; the most significant bit desig- 
ns nates the direction of a position including another concurrent audio pack ... "0" = backward, "1" = forward), each of 
which indicates the differential address value of an EGG block that includes an audio pack of the same time as the I- 
picture start time from the head of a VOBU, l-picture start audio sample numbers #1, #2, ... (2 bytes each), each of 
which indicates the sample number of the audio sample position of the same time as the l-picture start time in a given 
EGG block as a coefficient of serial numbers of all audio packs, audio synchronization information flags #1 , #2, ... (1 
20 byte each), each of which indicates the presence/absence of synchronization information between audio and video 
streams, and audio synchronization data (2 bytes) which is appended to each audio synchronization information flag 
only when the audio synchronization information flag indicates the "presence of synchronization information", and indi- 
cates the number of audio samples included in a corresponding VOBU. 

[0179] Each of l-picture start audio positions #1 , #2, ... in FIG. 9 indicates the differential address value of an EGG 
25 block that includes an audio pack of the same time as the l-picture start time from the head of the corresponding VOBU. 
[0180] Furthermore, l-picture start audio positions #1, #2, ... in FIG. 9 indicate the audio sample positions of the 
same time as the l-picture start time as counts of serial numbers of all audio packs. 

[0181] For example, upon dividing AV information in a given cell in video editing, when a VOBU in that cell is divided 
into two VOBUs and each divided information is re-encoded, division free from any interrupt of playback tones and any 
30 phase shift between playback channels can be implemented using the aforementioned information (l-picture start audio 
position #1 and l-picture start audio sample number #1) in FIG. 9. 
[0182] An example of this point will be explained below. 

[0183] The frequency shift of the reference clocks of a normal digital audio recorder is approximately 0.1%. When 
sound source information digitally recorded by a digital video tape (DAT) recorder is overdubbed on video information 
35 already recorded on a DVD-RAIVI disc by digital copy, the reference clocks between the video information and audio 
information may have an error of around 0.1%. Such reference clock error becomes so large that it cannot be ignored 
upon repeating the digital copy (or nonlinear edit using a personal computer or the like), and appears as interrupted 
playback tones or a phase shift between playback channels. 

[0184] In an embodiment of the present invention, synchronization information can be recorded as an option to syn- 
40 chronously play back video information and audio information (or to attain phase synchronization among channels of 
multi-channel audio data) even when the reference clocks of audio information have shifted. 

[0185] More specifically, in the audio synchronization information in FIG. 9, the presence/absence of synchroniza- 
tion information between audio and video streams can be set in units of audio stream IDs (#1 , #2, ...). 
[0186] When this audio synchronization information is present, the number of audio samples is described in units 
45 of VOBUs in audio synchronization data in that information. Using this information (the number of audio samples), syn- 
chronization between video information and audio information or synchronization between channels of multi-channel 
audio data can be attained in units of VOBUs in each audio stream upon playback. 

[0187] FIG. 10 exemplifies the hierarchical structure of information included in control information DA21 in FIG. 4. 

[0188] Each cell in FIG. 5 or 6 indicates a playback period that designates playback data by its start and end 
50 addresses. On the other hand, program chain PGG in FIG. 5 is a series of playback execution units that designate the 
playback order of cells. Playback of video object set VOBS in FIG. 5 is determined by program chains PGG and cells 
that form video object set VOBS. 

[0189] AV data control information DA210 in FIG. 10 has control information PGGGI of such program chain PGG. 
This PGG control information PGGGI is made up of PGG information management information PGG_MAI, k (one or 
55 more) PGG information search pointers, and k (one or more) pieces of PGG information, the number of which is equal 
to that of the PGG information search pointers. 

[0190] PGG information management information PGG_MAI includes information indicating the number of PGGs. 
Each PGG information search pointer points to the head of each PGG information PGGI, and allows easy search for 
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corresponding PGC information PGCi. 

[0191] Each PGC information PGCI includes PGC general information and m pieces of cell playback information. 
This PGC general information includes a PGC playback time and the number of pieces of cell playback information. 
[0192] Problems posed when the position of a sector as a minimum unit of address has deviated from that of video 
5 object unit VOBU shown in FIG. 6 will be explained below with reference to FIG. 1 1 . 

[0193] When new information is recorded on a data change area in FIG. 11 or information there is updated, com- 
plicated processes which: 

1 ) play back an ECC block present at the start position of VOBU#g; 

10 2) deinterleave the ECC block; 

3) change information of a portion that pertains to the data change area in the ECC block; 

4) re-assign error correction codes in the ECC block; and 

5) overwrite changed information at the ECC block position are required. As a consequence, a continuous record- 
ing process in NTSC video recording that requires a frame rate of 30 frames per sec is disturbed. 

15 

[0194] Furthermore, when the surface of an information storage medium (DVD-RAM disc 1 0) has dust or scratches, 
a recording process is influenced more seriously by such dust or scratches than a playback process. 
[0195] l\yiore specifically, when dust or scratches are present near that position of an ECC block which includes a 
sector that is to undergo processes 1) to 5) above, VOBU#g has been played back without any problem so far, but an 
20 information defect is produced by a rewrite process of the ECC block including that sector, and it may become impos- 
sible to play back VOBU#g. 

[0196] Also, every time information is rewritten in a data change area which is not relevant to VOBU#g, the start 

position of VOBU#g is required. A phase change recording film used as the recording material of the DVD-RAM disc 
has a tendency that its characteristics deteriorate after repetitive recording and defects increase. Hence, it is preferable 

25 to minimize the number of times of rewrite of a portion which need not be rewritten (the start position of VOBU#g in FIG. 
11) (the number of times of rewrite can be recorded in control information rewrite count CIRWNs in FIG. 4). 
[0197] For these reasons, in order to guarantee a continuous recording process at a frame rate of 30 frames per 
sec, to minimize the number of times of rewrite of unwanted portions, and so forth, in the present invention, the VOBU 
recording unit is set to match an integer multiple of ECC block size (32 kbytes), as shown in FIG. 6 (note that 2 kbytes 

30 of a sector are used as a minimum unit of address). This process is called 32-kbyte align. 

[0198] To attain 32-kbyte align, i.e., to set each VOBU size to always match an integer multiple of 32 kbytes before 
and after data change, a dummy pack (FIG. 7) having an appropriate size is inserted into each VOBU. 
[0199] A method of setting an AV address number set based on the aforementioned condition (32-kbyte align that 
sets the recording unit to match an integer multiple of ECC block size) will be explained below compared to another log- 

35 ical block number assignment method. 

[0200] In order to allow easy conversion to logical block numbers used in the file system, losses or repetitions of 
numbers due to replace processes for defects produced on information storage medium 10 are avoided. 
[0201 ] Upon recording video information, a replace process is done for a defect on the information storage medium. 
At this time, the setting location of an AV address moves on information storage medium 10 as a result of this replace 

40 process. 

[0202] Let "AVA" be the AV address number, "LBN" be the logical block number, and "LBNav" be the logical block 
number at the AV file start position. Then, the logical block number and AV address number satisfy: 

AVA = (LBN - LBNav) - 16 

45 

[0203] Note that digits after the decimal point of the quotient obtained upon division by 1 6 are dropped. 
[0204] FIG. 12 shows a case wherein the 32-kbyte align is executed by inserting a dummy pack into a cell whose 
data has been changed after video recording. Then, the boundary positions of video object units VOBU in the cell match 
those of ECC blocks (32 kbytes) which form data in that cell. 
50 [0205] AS a result, upon rewriting data later, data can be overwritten in units of ECC blocks (ECC need not be re- 
encoded). In addition, when the AV address uses an ECC block made up of 1 6 sectors as a unit, address management 
is easy even when overwrite (insert edit or the like) is made after video recording. Since this overwrite is made irrespec- 
tive of VOBU#g who data has not changed, playback of data of VOBU#g never fails due to rewrite of the data change 
area. 

55 [0206] When each VOBU size matches an integer multiple of 32 kbytes before and after data change even when 
no dummy pack is inserted (an integer multiple of 32 kbytes also means that of 2 kbytes of a sector), no dummy pack 
need be added for the purpose of the 32-kbyte align. However, since the dummy pack can be used in addition to 32- 
kbyte align (e.g., as an auxiliary area for postrecording and the like), an appropriate number of dummy packs are pref- 
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erably inserted irrespective of 32-l^byte align. 

[0207] FIG. 13 shows an example of playback of cell data recorded on disc 1 0 shown in FIG. 1 . 
[0208] As shown in FIG. 1 3, playback data is designated by a playback period from cell A to cell F. A playback com- 
bination of these cells in each program chain (PGC) is defined by program chain information. 
5 [0209] FIG. 1 4 is a table for explaining an example of the relationship between cells that form playback data shown 
in FIG. 13, and program chain information (PGCI) (see FIG. 5). 

[0210] More specifically, PGC#1 comprised of three cells #1 to #3 designates cell playback in the order of cell A 
cell B cell C. PGC#2 comprised of three cells #1 to #3 designates cell playback in the order of cell D cell E cell 
F. Furthermore, PGC#3 comprised of five cells #1 to #5 designates cell playback in the order of cell E cell A cell 
10 D ^ cell B cell E. 

[0211] In FIGS. 13 and 14, PGC#1 exemplifies a continuous playback period from cell A to cell C, and PGC#2 
exemplifies an intermittent playback period from cell D to cell F. On the other hand, PGC#3 shows an example that 
allows discontinuous cell playback independently of the cell playback direction and repetitive playback (cells C and D). 

15 Method of Assuring Continuous Playback Condition ) 

[0212] An indispensable condition for video information is to guarantee continuity upon playback unlike conven- 
tional computer information. As information for guaranteeing continuous playback, neither a special flag nor a statement 
are required. Information that guarantees continuity upon playback can be recorded in PGC control information PGCCI 
20 shown in FIG. 4. More specifically, "information that guarantees continuity upon playback" can be inserted in the form 
of adding a predetermined condition to a PGC coupling method that couples cells. Insertion of the predetermined con- 
dition will be explained below. 

[0213] FIG. 1 5 is a schematic view of a playback system to explain continuity of a playback signal. 

[0214] Video information recorded on information storage medium 1 0 is read by optical head 202, and is temporar- 

25 ily saved in buffer memory (semiconductor memory) 219. The video information read out from this buffer memory 219 
is sent externally. The transmission rate of the video information sent from optical head 202 to buffer 21 9 will be referred 
to as a physical transmission rate (PTR) hereinafter. Also, the average value of the transmission rate of the video infor- 
mation externally transferred from buffer memory 219 is named a system transmission rate (STR). In general, physical 
and system transmission rates PTR and STR assume different values. 

30 [0215] In order to play back information recorded at different locations on information storage medium 10 in turn, 
access operation for moving the focused spot position of optical head 202 is required. Seek access for moving entire 
optical head 202 is made to attain large movement; jump access for moving only an objective lens (not shown) for focus- 
ing laser is made to attain a movement for a very small distance. 

[0216] FIG. 16 shows a change over time in video information amount temporarily saved in buffer memory 219 

35 upon externally transferring video information while making access control. 

[0217] In general, since system transmission rate STR is higher than physical transmission rate PTR, the video 
information amount temporarily saved in buffer memory 21 9 continues to increase during the period of a video informa- 
tion time. When the temporarily saved video information amount has reached the capacity of buffer memory 21 9, a play- 
back process by optical head 202 is intermittently done, and the video information amount temporarily saved in buffer 

40 memory 21 9 maintains the buffer memory capacity full state (corresponding to a flat top of the graph in the video infor- 
mation playback time in FIG. 16). 

[0218] When video information recorded at another location on information storage medium 10 is to be played back 
successively, an access process of optical head 202 is executed. 

[0219] Three different access periods of optical head 202, i.e., a seek access time, a jump access time, and a rota- 
45 tion wait time of information storage medium 10, are required, as shown in FIG. 16. In these periods, since no informa- 
tion is played back from information storage medium 10, physical transmission rate PTR during these periods is 
substantially "0". By contrast, since average system transmission rate of video information to be externally sent remains 
constant, the video information temporary saved amount in buffer memory 219 keeps on decreasing (a graph that 
declines to the right during the seek access time, jump access time, or rotation wait time in FIG. 16). 
50 [0220] Upon completion of access of optical head 202, when playback from information storage medium 1 0 restarts 
(smaller one of the hatched video information playback times in FIG. 16), the video information temporary saved 
amount in buffer memory 219 increases again. 

[0221 ] This increase slope is determined by the difference between the physical transmission rate and average sys- 
tem transmission rate, i.e., (physical transmission rate PTR) - (average system transmission rate STR). 
55 [0222] After that, when access is made again near the playback position on information storage medium 10, since 
it can be attained by only jump access, only the jump access time and rotation wait time are required (a graph that 
declines to the right in FIG. 16). 

[0223] A condition that allows continuous playback in the playback operation shown in FIG. 16 can be defined by 
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the "upper limit vaiue of an access count during a specific period". Tfiat is, tfie information contents of PGC control infor- 
mation PGCCI shown in FIG. 4, e.g., a cell combination shown in FIG. 14, are set so that the access count assumes a 
value equal to or smaller than the "upper limit value of an access count during a specific period". 
[0224] An access count condition that absolutely disables continuous playback will be explained below using FIG. 
17. 

[0225] When the access frequency is highest, the video information playback time is very short, as shown on the 
right side of the graph center in FIG. 17, and only the jump access time and rotation wait time successively appear. In 
such case, it is impossible to assure playback continuity irrespective of physical transmission rate PTR. 
[0226] Let BM be the size of buffer memory 219. Then, the temporary saved video information in buffer memory 
219 is exhausted within a period: 

BIVI/STR (= BM - STR) (01) 

and continuous playback is disabled. 

[0227] Let JATi (Jump Access Time of an objective lens) be each jump access time, and MWTi (Spindle Motor Wait 
Time) be each rotation wait time. Then, in the example shown in FIG. 17, we have: 

BM/STR = E (JAT i + MWTi) (02) 

[0228] Approximating equation (02), and letting JATa be the average jump access time, MWTa be the average rota- 
tion wait time, and n be the access count within the period until the temporary saved video information in buffer memory 
219 is exhausted, equation (02) can be rewritten as: 

BM/STR = n • (JATa + MWTa) (03) 

[0229] In this case, an indispensable condition as "access count n until temporary saved video information in buffer 
memory 219 is exhausted" which is an absolute condition for assuring continuous playback is: 

n < BM/(STR • (JATa + MWTa)) (04) 

[0230] The value given by inequality (04) is rewritten to access count N per sec as: 

N = n/(BM/STR) < 1/(JATa + MWTa) (05) 

[0231] Since average system transmission rate STR when MPEG2 is used is around 4 Mbps (bits per second), and 
the average rotation cycle of a 2.6-GB DVD-RAM single-sided, single-layered disc is approximately 35 ms (millisec- 
onds), average rotation wait time MWTa is MWTa = 1 8 ms. On the other hand, a general information recording/playback 
apparatus has JATa = 5 ms. 

[0232] As a practical example of size BM of the buffer memory 21 9, some drives with a larger size have 2 Mbytes 
= 16 Mbits, but the buffer memory sizes of most drives (information recording/playback apparatuses) is around 512 

kbytes = 4 Mbits in the status quo (in terms of the product cost). 

[0233] Upon computing using buffer memory size BM = 4 Mbits, the shortest time until the temporary saved video 
information in buffer memory 219 is exhausted is 4 Mbits/4 Mbps = 1 sec. Substituting this value in inequality (04) 
yields: 

n < BM/(STR • (JATa + MWTa)) = 1 sec/(18 ms + 5 ms) = 43 

[0234] A computation example under the specified condition yields the aforementioned result (access count upper 
limit n = 43). However, since the computation result changes depending on the buffer memory size and average system 
transmission rate of the apparatus, equation (03) serves as a condition formula required to assure continuous playback. 
[0235] When access is made at an access frequency slightly lower than that obtained by equation (03), and when 
physical transmission rate PTR is much higher than average system transmission rate STR, continuous playback is 
enabled. 

[0236] In order to allow continuous playback by satisfying only the condition of equation (03), the following prereq- 
uisites must be satisfied: 

1) physical transmission rate PTR is extremely high; and 

2) all pieces of video information to be accessed are allocated at nearby locations and can be accessed by only 
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jump access without any seek access. 

[0237] Hence, a condition that can guarantee continuous playback even when physical transmission rate PTR is 
relatively low will be examined below. 
5 [0238] As shown in FIG. 18, when the video information playback time and access times have good balance, and 
the temporary saved video information in buffer memory 219 is maintained nearly constant from a global point of view, 
continuity of video information playback viewed from an external system can be assured without exhausting temporary 
saved video information in buffer memory 219. 

[0239] Let SATi (Seek Access Time of an objective lens) be each seek access time, SATa be the average seek 
10 access time after n accesses, DRTi (Data Read Time) be the playback information read time per access, and DTRa be 
the average playback information read time after n accesses. 

[0240] Then, the data amount externally transferred from buffer memory 219 during the total access period of n 
accesses is given by: 

15 STR X (E (SATi + JATi + MWTi)) = STR x n x (SATa + JATa + MWTa) (06) 

[0241] When the value given by equation (06) and the video information amount: 

(PTR - STR) X EDRTi = (PTR - STR) x n • DRTa (07) 

20 

Stored in buffer memory 219 upon playing back video information by n accesses satisfy 
(PTR - STR) X n • DRTa ^ STR x n x (SATa + JATa + MWTa) , i.e., 

(PTR - STR) • DRTa ^ STR • (SATa + JATa + MWTa) (08) 

25 

continuity of playback video viewed from the external system side can be assured. 
[0242] If N represents the average access count per sec, we have: 

1 = N • (DRTa + SATa + JATa + MWTa) (09) 

30 

[0243] From formulas (08) and (09), since 

1/(N • (SATa + JATa + MWTa) ^ 1 + STR/(PTR - STR) 

35 solving this inequality for N yields: 

N ^ 1/{[1 + STR/(PTR - STR)] • (SATa + JATa + MWTa)} (10) 

[0244] N of this inequality (1 0) defines the access count upper limit value per sec that can assure continuity of play- 
40 back video. 

[0245] The relationship between the seek access distance and the seek access time required therefor will be exam- 
ined below. 

[0246] FIG. 1 9 is a view for explaining the relationship between the seek distance and seek time of an optical head. 
[0247] When a target position has been reached while accelerating/decelerating at equal acceleration a, the mov- 
45 ing distance until time tmax at which the moving speed of optical head 202 becomes maximum is a • tmax • tmax/2 
from FIG. 1 9. Hence, total distance p the optical head has moved by seek access is given by: 

p = a • tmax • tmax (11) 

50 [0248] As can be seen from equation (1 1 ), the time required for seek access is proportional to the 1/2-th power (i.e., 
a square root) of the moving distance. 

[0249] FIG. 20 is a view for explaining the average seek distance of the optical head. 

[0250] The average seek distance (average seek access distance) upon recording video information on an area 
having radial width L will be examined. As shown in FIG. 20, the average seek distance from a position distance XO from 
55 the end (of a seek area) to all recording areas is given by: 

X0X0/2L + (L - XO) • (L - X0)/2L (12) 
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[0251] When the average value upon moving XO from 0 to L is computed using formuia (12), the average seel^ dis- 
tance as a result of integrating XO under prescribed conditions is: 

US (13) 

[0252] A case will be examined below wherein an area corresponding to half the radial width on optical disc 10 

which corresponds to data area DA shown in FIG. 4 is used to record AV data area DA2. 

[0253] In this case, from formula (13) the average seek distance (average seek access distance) is 1/6 the radial 
width on optical disc 1 0 corresponding to data area DA. 

[0254] For example, when optical head 202 takes 0.5 sec to move (seek) from the innermost periphery to the out- 
ermost periphery of the recording area (data area DA in FIG. 4), from equation (11) the average seek time (average 
seek access time) within AV data area DA2 is: 

SATa = 200 ms (14) 
which is a value proportional to the 1/2-th power of 1/6 of 0.5 sec. 

[0255] For example, MWTa = 1 8 ms and JATa = 5 ms are used in computations, as described above. In such case, 
a 2.6-GB DVD-RAM disc has PTR = 1 1 .08 Mbps. When the average transmission rate of MPEG2 is STR = 4 IVIbps, if 
the aforementioned values are substituted in inequality (10), N ^ 2.9 is obtained. 

[0256] FIG. 21 is a schematic view of a recording system to explain continuity of a recording signal. 
[0257] Recording information is externally sent to buffer memory 219 at average system transmission rate STR 
(around 4 Mbps in MPEG2 video). Buffer memory 219 temporarily holds the sent information (MPEG video data and 
the like), and transfers the held information to optical head 202 at physical transmission rate PRT that matches the stor- 
age medium and the type of drive. 

[0258] In order to record the information at different locations on information storage medium 10 in turn, access 
operation that moves the focused spot position of optical head 202 is required. Seek access for moving entire optical 
head 202 is made to attain large movement; jump access for moving only an objective lens (not shown) for focusing 
laser is made to attain a movement for a very small distance. 

(Access Frequency Reduction Method; Re-arrange Cells by Editing) 

[0259] FIG. 22 exemplifies cells that form a portion of recorded AV data (video signal information), and video object 
unit VOBU sequences of the respective cells. 

[0260] FIG. 23 is a view for explaining a case wherein cell #2 is edited and data falls short in the middle of cell #2 
(at the position of VOBU 108e) in the sequence shown in FIG. 22 (VOBU 108e is re-encoded). 

[0261] Furthermore, FIG. 24 is a view for explaining changes in cell configuration, VOBU sequences, and position 
of a free area exemplified in FIG. 22 upon completion of edit in FIG. 23. 

[0262] In order to guarantee the seamless, continuous playback or recording, each cell layout in PGC information 
(FIGS. 10 and 14) in PGC control information PGCCI in FIG. 4 is set to satisfy the condition of formula (5) or (10). How- 
ever, when the access frequency becomes higher than a seamless guarantee value due to user requests in the edit 
operation, the access frequency reduction process is executed again to satisfy the conditions of formulas (03) or (08). 
[0263] This re-process will be explained below. 
[0264] Assume that the cell playback order: 

cell #1 cell #2 cell #3 

is initially set, as shown in FIG. 22 (in this case, no access occurs during playback). 

[0265] Then, the user divides cell #2 into cell #2A and cell #2B (FIG. 23), and sets the cell playback order: 

cell #2A cell #1 cell #2B cell #3 

In this case, the access count increases by two, that is: 

access from the end of cell #2A to the head of cell #1 ; and 
access from the end of cell #1 to the head of cell #2B. 

[0266] In this manner, when formula (03) or (08) cannot be satisfied as a result of an increase in access count in 
that PGC, cell #2A is moved to free area 107, as shown in FIG. 24. 
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[0267] As a result, the access count in the PGC that defines the playback order "ceil #2A cell #1 cell #2B 
cell #3" decreases to one, that is: 

access from the end of cell #1 to the head of cell #2B. 

5 

[0268] As in the above example, when formula (03) or (08) cannot be satisfied, some cells are moved (to change 
the recording location on information storage medium 1 0), thus lowering the access frequency. In this way, formula (03) 
or (08) is satisfied to guarantee seamless, continuous playback or recording in that PGC. 

[0269] When formula (03) or (08) is not satisfied even after an increase in access count due to editing is decreased 
10 by the aforementioned method, the user re-checks the cell configuration of the PGC to re-configure the number and 
sequence (layout) of cells in the PGC so as to satisfy formula (03) or (08). 

fulethod of Assuring Continuous Recording Condition) 

15 [0270] FIG. 25 is a graph for explaining an example of the relationship between access operations and the tempo- 
rary saved amount in the buffer memory upon continuous recording of a video signal (when the access frequency is 
highest). FIG. 26 is a graph for explaining another example of the relationship between access operations and the tem- 
porary saved amount in the buffer memory upon continuous recording of a video signal (when the recording time and 
access time have good balance). 

20 [0271] Unlike in the "case wherein continuous playback is disabled when the temporary saved video information 
amount on buffer memory 219 is exhausted" that has been explained with reference to FIG. 17, the temporary saved 
video information amount on buffer memory 219 is saturated upon continuous recording, as shown in FIG. 25. 
[0272] More specifically, as can be seen from a comparison between FIGS. 25 and 1 7, formula (03) can be applied 
to the access frequency that satisfies the continuous recording condition. 

25 [0273] Likewise, as can be seen from a comparison between FIGS. 26 and 18, formula (08) can be applied to the 
access frequency that satisfies the continuous recording condition. 

[0274] According to the "conditional formula for assuring continuity" that has been explained with reference to 

FIGS. 16 to 20 and FIGS. 25 and 26, seamless, continuous playback or recording (free from any interrupt during play- 
back or recording) can be guaranteed irrespective of the characteristics of an information recording/playback apparatus 
30 (drive) used. 

[0275] FIG. 27 is a block diagram for explaining the arrangement of a DVD video recorder which can cope with a 
synchronization error between video and audio upon re-arranging (e.g., editing) video information in a video object. 
[0276] The apparatus main body of the DVD video recorder shown in FIG. 27 is roughly constructed by disc drive 
32 for rotating DVD-RAM (DVD-RW) disc 10, and reading/writing information to/from disc 10, disc changer (or disc 
35 pack) 100 which automatically supplies predetermined disc 10 to disc drive 32, and can load a plurality of discs 10, 
encoder 50 on the video recording side, decoder 60 on the playback side, and main MPU 30 for controlling the opera- 
tions of the apparatus main body. 

[0277] Data processor 36 can have functions of supplying DVD recording data output from encoder 50 to disc drive 
32, receiving a DVD playback signal played back from disc 10 via drive 32, rewriting management information recorded 

40 on disc 10, and erasing data recorded on disc 10, under the control of main MPU 30. 

[0278] Also, data processor 36 forms ECC groups by combining packs sent from formatter 56 in units of 1 6 packs, 
appends error correction information to each ECC group, and sends these ECC groups to disc drive 32. In this case, 
when disc drive 32 is not ready to record on disc 10, ECC group data appended with error correction information are 
transferred to temporary storage 34, and are temporarily stored therein until drive 32 is ready to record. When disc drive 

45 32 is ready to record, recording of data stored in temporary storage 34 on disc 1 0 starts. 

[0279] Main MPU 30 includes a ROM written with control programs and the like, a RAM that provides a work area 
required for executing a program, an audio information synchronization processor, a telephone l/F or Internet l/F, and 
the like. 

[0280] MPU 30 executes an audio information synchronization process (to be described later; FIG. 29) and other 
50 processes using its RAM as a work area in accordance with the control programs stored in its ROM. 

[0281] Of the execution results of main MPU 30, the contents that the DVD video recorder user is informed of are 
displayed on a display unit (not shown) of the DVD video recorder, or are displayed on a monitor display (not shown) in 
an on-screen display (OSD) mode. 

[0282] The information recording/playback apparatus portion that writes/reads (records and/or plays back) informa- 
55 tion to/from DVD disc 1 0 comprises disc changer (disc pack) 100, disc drive 32, temporary storage 34, data processor 
36, and system time counter (or system time clock; STC) 38. 

[0283] Temporary storage 34 is used to buffer a predetermined amount of data of those to be written in disc 1 0 via 
disc drive 32 (i.e., data output from encoder 50), and to buffer a predetermined amount of data of those played back 



21 



EP 1 065 665 A1 



from disc 1 0 via disc drive 32 (i.e., data input to decoder 60). in tliis sense, temporary storage 34 in FiG. 27 iias a func- 
tion corresponding to buffer memory 219 in FIG. 21. 

[0284] For example, when temporary storage 34 is comprised of a semiconductor memory (DRAM) of 4 to 8 
Mbytes, it can buffer recording or playback data for approximately 8 to 16 sec at an average recording rate of 4 Mbps. 
On the other hand, when temporary storage 34 Is comprised of a 16-Mbyte EEPROM (flash memory), it can buffer 
recording or playbaci< data for approximately 32 sec at an average recording rate of 4 Mbps. Furthermore, when tem- 
porary storage 34 is comprised of a 1 00-Mbyte very compact HDD (hard disc), it can buffer recording or piaybacl^ data 
for 3 min or more at an average recording rate of 4 Mbps. 

[0285] When the DVD video recorder has an external card slot (not shown in FiG. 27), the EEPROM may be sold 
as an optional iC card. On the other hand, when the DVD video recorder has an external drive slot or SCSI interface, 
the HDD can be sold as an optional expansion drive. 

[0286] In this connection, in an embodiment (not shown) in which a DVD video recorder is implemented by software 
using personal computer PC, the free space of a hard disc drive or a main memory of PC itself can be partially used as 
temporary storage 34 in FIG. 27. 

[0287] Temporary storage 34 can also be used to temporarily store recording information until disc 1 0 is exchanged 
by a new one, when disc 10 has been fully recorded during video recording, in addition to the aforementioned purpose 
of guaranteeing "seamless, continuous playback or recording". 

[0288] Furthermore, when disc drive 32 uses a high-speed drive (double-speed or higher), temporary storage 34 
can be used to temporarily store data that excesses data to be read out from a normal drive within a predetermined 
period of time. 

[0289] When read data upon playback is buffered on temporary storage 34, even when an optical pickup (not 
shown) produces read errors due to a vibration shock or the like, playback data buffered on temporary storage 34 can 

be used instead, thus preventing the played back picture from being interrupted. 

[0290] As an analog signal source of a raw signal to be recorded on disc 1 0, a video playback signal of VHS video, 
laser disc LD, or the like is available, and this analog video signal is input to encoder 50 via an AV input shown in FIG. 
27. 

[0291] As another analog signal source, normal analog TV broadcast (ground or satellite broadcast) is available, 

and this analog TV signal is input to encoder 50 from a TV tuner shown in FIG. 27 (in case of TV, text information such 
as closed caption or the like is often broadcasted simultaneously with video information, and such text information is 
also input to encoder 50). 

[0292] As a digital signal source of a raw signal to be recorded on disc 10, a digital output or the like of a digital 
broadcast tuner is available, and this digital video signal is directly input to encoder 50. 

[0293] When this digital tuner has an IEEE1394 interface or SCSI interface, its signal line is connected to main 
MPU 30. 

[0294] When a bitstream (including MPEG-encoded video) of DVD video is digitally broadcasted directly, and the 
digital tuner has a digital output of the bitstream, since the bitstream output has already been encoded, it is directly 
transferred to data processor 36. 

[0295] Note that the analog video output of a digital device which does not have any digital video output but has 
digital audio output (e.g., digital video cassette DVC or digital VHS video DVHS) is connected to the AV input, and its 
digital audio output is supplied to encoder 50 via sample rate converter SRC. This SRC converts a digital audio signal 

having a sampling frequency of, e.g., 44.1 kHz into that having a sampling frequency of 48 kHz. 

[0296] Although no signal lines are illustrated in FIG. 27, when personal computer PC can output a digital video sig- 
nal in the DVD video format, that digital video signal can be directly input to encoder 50. 

[0297] All digital input audio signal sources (digital tuner, DVC, DVHS, PC, and the like) are connected to main MPU 
30. This is done to use such signals in the "audio synchronization process" to be described later. 
[0298] The control timings that main MPU 30 controls disc changer (disc pack) 100, disc drive 32, data processor 
36, and encoder 50 and/or decoder 60 can be determined based on time data output from STC 38 (video record- 
ing/playback operations are normally done in synchronism with time clocks from STC 38, but other processes may be 
executed at timings independently of STC 38). 

[0299] A DVD digital playback signal which is played back from disc 1 0 via disc drive 32 is input to decoder 60 via 
data processor 36. 

[0300] As will be described in detail later using FIG. 28, decoder 60 includes a video decoder for decoding a main 
picture video signal from the input DVD digital playback signal, a sub-picture decoder for playing back a sub-picture sig- 
nal from this playback signal, an audio decoder for playing back an audio signal from this playback signal, a video proc- 
essor for compositing the decoded sub-picture on the decoded main picture, and a means (reference clock generator) 
for correcting timing errors between video and audio signals or among channels of a multi-channel audio signal. 
[0301] A video signal (main picture + sub-picture) decoded by decoder 60 is supplied to video mixer 602. Video 
mixer 602 receives reduced-scale picture/thumbnail picture data (see FIG. 4) and text data from main MPU 30 as 
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needed. This thumbnail picture (and/or text) are/is composited on the decoded video signai on frame memory 604 to 
generate a visual menu (user menu) used in a recorded content search and the like. 

[0302] When thumbnail pictures for the user menu are displayed on a monitor (not shown), a thumbnail picture file 
previously saved as an independent file is flowed as stream packs, and is displayed by designating display positions (X- 
and Y-coordinate values) in frame memory 604. At this time, if text data or the like is included, text is displayed below 
each thumbnail picture using a character ROM (or kanji ROIVI). 

[0303] A digital video signal including this visual menu (user menu) as needed is output outside the apparatus 
shown in FIG. 27 via a digital video l/R Also, the digital video signal including the visual menu as needed is converted 
into an analog video signal via a video DAC, and the analog video signal is sent to an external analog monitor (a TV 
with an AV input). 

[0304] Note that thumbnail picture data for the user menu may be inserted into recording data as independent video 
pack data in place of the aforementioned independent file. That is, the DVD video format specifies "0" (stream ID = 
OEOh) as a stream number for main picture data, and can also specify "1 " (stream ID = 0E1 h) as that for thumbnail pic- 
ture data and multiplex such stream. The multiplexed thumbnail pictures with the stream number = "1" serve as source 
data used in a menu edit process. 

[0305] FIG. 28 is a block diagram for explaining the internal arrangement of the encoder and decoder in the 
arrangement shown in FIG. 27. 

[0306] Encoder 50 comprises ADC (analog-to-digital converter) 52, video encoder 53, audio encoder 54, sub-pic- 
ture encoder 55, formatter 56, buffer memory 57, frame memory 51 for thumbnail pictures, thumbnail video encoder 58, 
and memory 59 used upon encoding thumbnail pictures. 

[0307] ADC 52 receives an external analog video signal + external analog audio signal from the AV input in FIG. 
27, or an analog TV signal + analog audio signal from the TV tuner. ADC 52 converts the input analog video signal into 

a digital signal at a sampling frequency of, e.g., 13.5 MHz/6.75 MHz and 8 quantization bits. 

[0308] That is, luminance component Y is converted into digital data at a sampling frequency of 13.5 MHz and 8 
quantization bits, and color difference components Cr (or Y-R) and Cb (or Y-B) are respectively converted into digital 
data at a sampling frequency of 6.75 MHz and 8 quantization bits. 

[0309] Similarly, ADC 52 converts the input analog audio signal into a digital signal at a sampling frequency of, e.g., 
48 kHz and 16 quantization bits. 

[0310] When an analog video signal and digital audio signal are input to ADC 52, the digital audio signal passes 
through ADC 52. (The digital audio signal may undergo processes for reducing jitter alone, changing the sampling rate 
or the number of quantization bits, and the like without changing its contents.) 

[0311] On the other hand, when a digital video signal and digital audio signal are input to ADC 52, these signals 
pass through ADC 52 (these signals may also undergo a jitter reduction, sampling rate change process, and the like 
without changing their contents). 

[0312] A digital video signal component output from ADC 52 is supplied to formatter 56 via video encoder 53. Also, 
a digital audio signal component output from ADC 52 is supplied to formatter 56 via audio encoder 54. 
[0313] Video encoder 53 has a function of converting the input digital video signal into a digital signal compressed 
at a variable bit rate by MPEG2 or MPEG1. 

[0314] Audio encoder 54 has a function of converting the input digital audio signal into a digital signal compressed 
at a fixed bit rate (or linear PCM digital signal) by MPEG or AC-3. 

[0315] When a DVD video signal is input from the AV input or when a DVD video signal (digital bitstream) is broad- 
casted and received by TV tuner 44, a sub-picture signal component (sub-picture pack) in the DVD video signal is sent 
to sub-picture encoder 55. Alternatively, if a DVD video player with a sub-picture signal independent output terminal is 
available, a sub-picture signal component can be extracted from that sub-picture output terminal. Sub-picture data input 
to sub-picture encoder 55 is arranged into a predetermined signal format, and is then supplied to formatter 56. 
[0316] Formatter 56 performs predetermined signal processes for the input video signal, audio signal, sub-picture 
signal, and the like while using buffer memory 57 as a work area, and outputs recording data that matches a predeter- 
mined format (file structure) to data processor 36. 

[0317] The respective encoders (53 to 55) compress and packetize the input signals (video, audio, and sub-pic- 
ture). (Note that packets are segmented and packetized to have a size of 2,048 bytes per pack.) These compressed 
signals are input to formatter 56. Formatter 56 determines and records presentation time stamp PTS and decoding time 
stamp DTS of each packet in accordance with the timer value from STC 38 as needed. 

[0318] In this case, packets of thumbnail pictures used in the user menu are transferred to and temporarily saved 
in memory 59 for storing thumbnail pictures. The thumbnail picture packet data is recorded as an independent file upon 
completion of video recording. The size of each thumbnail picture on the user menu is selected to be, e.g., approxi- 
mately 144 pixels X 96 pixels. 

[0319] Note that MPEG2 compression, which is the same as the compression format of main picture data, can be 
used as that of thumbnail pictures, but other compression schemes may be used. For example, other compression 
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schemes such as JPEG compression, runlength compression (pallet = 256 colors: requires a reduction to 256 colors), 
TIFF format, PICT format, and the like can be used. 

[0320] Formatter 56 temporarily saves packet data in buffer memory 57, then packs the input packet data to mix 
them in units of GOPs of MPEG, and transfers the packs to data processor 36. 

[0321] The contents of standard encoding for generating the recording data to be transferred to data processor 36 
will be briefly explained below. 

[0322] When encoder 50 starts encoding, parameters required for encoding video (main picture) data and audio 
data are set. The main picture data is pre-encoded using the set parameters to compute an optimal code amount dis- 
tribution to a predetermined average transmission rate (recording rate). Based on the code amount distribution obtained 
by pre-encoding, the main picture data is encoded. At this time, the audio data is encoded at the same time. 
[0323] As a result of pre-encoding, when data compression is insufficient (when a desired video program cannot 
be stored in a DVD-RAM or DVD-R disc used to record data), if pre-encoding can be done again (for example, if the 
recording source is the one capable of read many such as a video tape, video disc, or the like), the main picture data is 
partially re-encoded, and the re-encoded main picture data portion replaces the previously pre-encoded main picture 
data portion. With a series of such processes, the main picture data and audio data are encoded, and the average bit 
rate value required for recording is reduced largely. 

[0324] Likewise, parameters required for encoding the sub-picture data are set, and encoded sub-picture data is 
generated. 

[0325] The encoded main picture data, audio data, and sub-picture data are combined and converted into a data 
structure for video recording. 

[0326] That is, the configuration of cells that construct program chain PGC shown in FIG. 5 or 14, attributes of main 
picture, sub-picture, and audio data, and the like are set (some pieces of such attribute information use information 
obtained upon encoding the individual data), and information management table information containing various kinds 

of information is created. 

[0327] The encoded main picture data, audio data, and sub-picture data are segmented into packs each having a 
predetermined size (2,048 bytes), as shown in FIG. 6. Dummy packs (FIG. 7) are inserted into these packs as needed 
to implement the aforementioned "32-kbyte align". 

[0328] Packs other than the dummy packs describe time stamps such as a PTS (presentation time stamp; see FIG. 
6), DTS (decoding time stamp), and the like as needed. As for the PTS of sub-picture data, a time arbitrarily delayed 
from that of main picture data or audio data in the same playback time zone can be described. 

[0329] The data cells are arranged in units of VOBUs so as to play back data in the order of their time codes, thus 
formatting a VOBS constructed by a plurality of cells, as shown in FIG. 5, as video object DA22. 

[0330] When a DVD playback signal is digitally copied from a DVD video player, since the contents of cells, program 
chain, management tables, time stamps, and the like are predetermined, they need not be generated again. (When a 
DVD video recorder is designed to digitally copy a DVD playback signal, copyright protection means such as digital 
watermarking or the like must be taken.) 

[0331] Decoder 60 in FIG. 28 comprises: reference clock generator 61 for generating sync-locked reference clocks 
on the basis of audio synchronization signal A-SYNC sent from main MPU 30 in FIG. 27; separator 62 for separating 
and extracting packs from playback data with the structure shown in FIG. 6; memory 63 used upon signal processes 
such as pack separation and the like; video decoder 64 for decoding main picture data (the contents of a video pack) 
separated by separator 62; sub-picture decoder 65 for decoding sub-picture data (the contents of a sub-picture pack) 
separated by separator 62; video processor 66 for compositing sub-picture data output from sub-picture decoder 65 
with video data output from video decoder 64, as needed, and outputting main picture data with superimposed sub-pic- 
ture data such as menus, highlight buttons, superimposed dialog, and the like; audio decoder 68 for decoding audio 
data (the contents of an audio pack) separated by separator 62 at the timing of the reference clock from reference clock 
generator 61; a digital audio l/F for externally outputting a digital audio signal from audio decoder 68; and a DAC for 
converting the digital audio signal from audio decoder 68 into an analog audio signal, and externally outputting the ana- 
log audio signal. 

[0332] The analog audio signal from this DAC is supplied to external components (not shown; a multichannel ster- 
eophonic apparatus having two to six channels). 

[0333] Note that audio synchronization signal A-SYNC is used to synchronize audio signals in units of, e.g., VOBUs 
shown in FIG. 6. 

[0334] Main MPU 30 in FIG. 27 can generate audio synchronization signal A-SYNC by detecting audio synchroni- 
zation packs, when a digital audio signal sent from a digital input device includes the configuration shown in FIG. 6 and 
the audio synchronization packs (SNV_PCK; not shown) are inserted at the head positions of respective VOBUs. 
[0335] Alternatively, main MPU 30 in FIG. 27 can generate audio synchronization signal A-SYNC using PTS infor- 
mation obtained by detecting presentation time stamps PTS (FIG. 6) included in audio packs. 
[0336] In the arrangement shown in FIGS. 27 and 28, the data processes upon playback are done as follows. 
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[0337] Upon receiving a playbacl^ start command by user's operation, main IVIPU 30 loads tine management area 
of disc 10 from disc drive 32 via data processor 36, and determines tine address to be played back (corresponding to 
tine address using common logical sector number LSN). 

[0338] Main MPU 30 then sends the previously determined address of playback data and a read command to disc 
drive 32. 

[0339] An MPU (not shown) in disc drive 32 reads out sector data from disc 10 in accordance with the received 
command, and data processor 36 makes error correction of the readout data and sends the data to decoder 60 in the 
form of pack data. 

[0340] In decoder 60, the readout pack data are packetized. Then, video packet data (MPEG video data) is trans- 
ferred to video decoder 64, audio packet data to audio decoder 68, and sub-picture packet data to sub-picture decoder 

65 in correspondence with the data purposes. 

[0341] At the beginning of transfer of these packet data, presentation time stamp PTS is loaded to STC 38. After 
that, the respective decoders in decoder 60 execute playback processes in synchronism with the PTS value in packet 
data (while comparing PTS and STC values), thus displaying a moving picture with audio and superimposed dialog on 

a TV monitor (not shown). 

[0342] By setting the aforementioned AV address, video information in a plurality of DVD-ROM and/or DVD-RAM 
discs inserted into a multiple disc pack (disc changer 1 00 in FIG. 27) can be loaded as a part of an AV file. 
[0343] In a DVD video (DVD-ROM) disc, the recording location of a video object is set by a logical block number (or 
a logical or physical sector number) as a file entry. In this case, when address conversion table ACT shown in FIG. 4 is 
used, this logical block number can be converted into the AV address. This address conversion table ACT describes 
pairs of logical block numbers and AV addresses on a table. 

[0344] FIG. 29 is a flow chart for explaining a synchronization process between video and audio in the DVD video 
recorder shown in FIG. 27. 

[0345] A video signal input from the AV input such as a TV tuner, VTR, camera recorder, or the like is converted 
into a digital signal by ADC 52 (step ST200). 

[0346] The converted digital signal is separated into video information and audio information, which are individually 
encoded by video encoder 53 and audio encoder 54. Closed caption information or information sent as superimposed 
text of teletext is encoded by sub-picture encoder 55 as sub-picture data. The encoded video information, audio infor- 
mation, and sub-picture information are respectively inserted in video, audio, and sub-picture packs in units of 2,048 
bytes by formatter 56, and these packs are arranged in units of VOBUs having a size corresponding to an integer mul- 
tiple of 32 kbytes, as shown in FIG. 6 (step ST202). 

[0347] At this time, formatter 56 extracts information indicating "the sample position of the audio information sample 

position at the l-picture display start time at the head of a VOBU and how many packs this audio pack goes back (or 
ahead) with reference to the position of a video pack" (step ST204A). 

[0348] The extracted audio information sample position information is sent to main MPU 30 in FIG. 27. 

[0349] The audio information synchronization processor in main MPU 30 sends back to formatter 56 a signal for 
generating presentation time stamp PTS or synchronization navigation pack SNV_PCK (not shown) as a source of 
audio synchronization signal A-SYNC on the basis of the received audio information sample position information. 
[0350] Formatter 56 sends VOBU information shown in FIG. 6, which includes the source information (PTS or 
SNV_PCK) of audio synchronization signal A-SYNC together with the encoded video information, sub-picture informa- 
tion, and audio information, to data processor 36. Parallel to "audio information sample position information extraction 
step ST204A" which is then repeated, data processor 36 records video object DA22 consisting of VOBU information 
shown in FIG. 6 at the designated address (AV address) of disc 10 (step ST204B). 

[0351] As the recording progresses, disc drive 32 returns address information (logical sector number LSN) used in 
recording to main MPU 30. Main MPU 30 computes the recording location on disc 10 (e.g., the physical sector number 
PSN position on disc 10 of an audio information sample at the l-picture display start position at the head position of a 
given recorded VOBU) on the basis of the returned address information and the correspondence between the prede- 
termined and sector. This computation result is used in step ST208 later. 

[0352] The recording location on disc 10 (the physical sector number PSN position on disc 10 of an audio informa- 
tion sample at the l-picture display start position at the head position of a given VOBU) corresponds to "l-picture audio 
positions #1, #2, ..." included in audio synchronization information shown in FIG. 9. That is, the differential address 
value of an ECC block that includes an audio pack of the same time as the l-picture audio position l-picture start time 
shown in FIG. 9 is recorded using 1 byte. Of 1 byte, whether the audio sample position is located before or after the 
head position of a given VOBU is identified using the most significant bit. More specifically. 

Most significant bit = 0: located before VOBU 
Most significant bit = 1 : located after VOBU 
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[0353] Recording of video object DA22 on disc 10 proceeds until a recording end input is detected (for example, 
until the user instructs to stop recording or until the free area of disc 10 is used up) (NO in step ST206; ST200 to 
ST204A/ST204B). 

[0354] If the recording end input is detected (YES in step ST206), information that pertains to recording such as the 
recording end address (physical sector number PSN on disc 10), the recording date/time, and the like is written in the 
management area (control information DA21 ) on disc 1 0 (step ST208). At this time, upon writing information in the man- 
agement area, control information rewrite count CIRWNs shown in FIG. 4 is incremented by 1 . 

[0355] Note that a value that counts the sample number in an ECC block at the audio sample position of the same 
time as the l-picture start time using serial numbers of all audio packs is written in the management area (control infor- 
mation DA21) as "l-picture start audio sample numbers #1, #2, ..." included in the audio synchronization information 

shown in FIG. 9 (step ST208). 

[0356] Note that the recording location on disc 1 0 need not always be expressed by the AV address in units of ECC 
blocks (16 sectors). The "recording location on disc 10" can also be expressed using the logical block number, logical 
sector number, or physical sector number as the AV address. 

tdit Process of Cell including Audio Synchronization Information in FIG. 9) 

[0357] A case will be examined below wherein cell #2 In information recorded on disc 1 0 in the order of cell #1 , cell 
#2, and cell #3, as shown in FIG. 22, is divided into cells #2A and #2B, as shown in FIG. 23, cell #2A is moved to free 
area 91, as shown in FIG. 24, and re-arranged cells are played back in the order of: 

cell #2A ^ cell #1 cell #2B cell #3 

[0358] In this case, VOBU 108e is re-encoded, and is divided into VOBUs 108p and 108q. The audio information 
synchronization processor in main MRU 30 searches for the position of an audio pack included in cell #2A to be moved 
on the basis of the l-picture audio position (FIG. 9) and l-picture audio sample number (FIG. 9) from disc 10. 
[0359] If the audio pack included in cell #2 is present in either VOBU 108e or 108q, the corresponding audio pack 

is extracted therefrom, and is embedded in VOBU 108d* or 108p. 

[0360] In this case, if that VOBU has an extra dummy pack (having no significant recording data), the audio pack is 
embedded in such dummy pack. If no such dummy pack is available, re-arrangement of the format and re-encoding in 
some cases are done. 

[0361] On the other hand, when cell #2A includes an audio pack used by VOBU 108c or 108f, the corresponding 
audio pack is copied from cell #2A, and is inserted (embedded) in VOBU 108c or 108f. At this time, the insertion 
(embedding) result is recorded in the l-picture audio position and l-picture start audio sample number (FIG. 9). A series 
of operation control processes are mainly executed by the audio information synchronization processor in main MRU 
30 in FIG. 27. 

[0362] A case will be explained below wherein existing audio information from a digital audio information storage 
medium such as a CD, MD, or the like is overdubbed as background music on video information after playback/editing 
mentioned above. 

[0363] A method of overdubbing audio information includes a method of replacing dummy packs in FIGS. 6 and 7 

by audio packs, and a method of re-encoding audio information to be overdubbed. 

[0364] In some cases, the sampling frequency (32 kHz or 44.1 kHz) of audio information is different from that (48 
kHz or 96 kHz) of audio information in the recorded video information. Even when the nominal frequency remains the 
same, the frequency drift (fluctuation of frequency) of a quartz oscillator that generates the reference frequency is nor- 
mally around ±0.1%. Therefore, when digital audio information is digitally dubbed, recording is done at different refer- 
ence frequencies. As a consequence, when data is played back at the frequency of originally recorded audio frequency, 
a synchronization error occurs. 

[0365] In order to prevent such error, in the present invention, the number of audio samples in units of VOBUs for 
the digitally dubbed audio information can be recorded as an option in the management area (control information DA21 
in FIG. 4). 

[0366] More specifically, as shown in audio synchronization flags #1, #2, ... in FIG. 9, a flag indicating whether or 
not audio synchronization data is recorded is set for each audio stream number, and when the audio synchronization 
data is to be recorded (flag is set), the number of audio samples of each VOBU is expressed by 2 bytes in the audio 
synchronization information in FIG. 9. 

[0367] This audio synchronization information can be recorded as follows. 

[0368] Audio information to be overdubbed is converted into audio packs in units of 2,048 bytes by formatter 56 in 
FIG. 28. At this time, the audio information synchronization processor in main MRU 30 in FIG. 27 sends information as 
to required times in units of VOBUs of the video information of interest. Based on that time information, formatter 56 
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replies the numbers of audio samples in units of VOBUs to the audio information synchronization processor. 

[0369] Then, audio packs including the audio information to be overdubbed are replaced by dummy packs, thus 

completing video object DA22. 

[0370] After that, based on the numbers of audio samples in units of VOBUs replied from formatter 56 to main MPU 
30, the audio information synchronization processor records required information in the audio synchronization informa- 
tion on disc 10. 

[0371] Upon playback, the audio information synchronization processor in main MPU 30 reads the audio synchro- 
nization information on disc 10, and sends the numbers of audio samples in units of VOBUs to reference clock gener- 
ator 61 in the form of "audio synchronization signal A-SYNC". Reference clock generator 61 generates reference clocks 
with the frequency adjusted (sync-locked) to that information (A-SYNC), and audio decoder 68 plays back post-inserted 
audio information (overdubbed audio information) in synchronism with video information in correspondence with the fre- 
quency of the generated reference clocks. 

[0372] In this manner, audio playback free from any synchronization errors from video information can be imple- 
mented. 

[0373] In the above description, the numbers of audio samples are recorded in units of VOBUs. However, the 
present invention is not limited to this, and the numbers of audio samples may be recorded in units of cells or in units 
of video frames (or video fields). 

[0374] According to the aforementioned embodiment, the following effects are obtained: 

A) video information can be re-arranged while guaranteeing synchronization of an audio signal; 

B) even when digital audio information generated at a sample frequency different from that of an original is 
recorded in dummy packs or the like by a digital dubbing process after video recording, audio information can be 

synchronously played back; and 

c) even when multi-channel audio information of, e.g., AC-3 is re-arranged or mix-down edit from digital sources 
having different sampling frequencies is done, synchronization among channels can be guaranteed. 

[0375] In the above description, a DVD- RAM disc has been exemplified as an information storage medium. Alter- 
natively, the system of the present invention (especially, a system that performs address management and replace 
processes in units of 32-kbyte EGG blocks; or a system that performs address management and replace processes in 
units of 2-kbyte sectors) can be applied to a system which uses a file allocation table (FAT) for a personal computer in 
a file system using a magnetooptical disc (MO disc) as an information storage medium. 

[0376] As system software (or operating system), NTFS (New Technology File System), UNIX, and the like can be 
used in addition to MS Windows. More specifically, required system software (one or a plurality of kinds of operating 
systems OS), application software, and the like are recorded as an embossed pattern on ROM layer 17A in a 
ROM/RAM double-layered disc, the OS and directory information of ROM layer 17A are copied to a main memory of a 
personal computer in a recording/playback process, and the application software stored in ROM layer 17A can be 
directly used. In this case, since the application software need not be mapped on the main memory, the main memory 
space can be broadened. In such personal computer system, RAM layer 17B of identical disc 10 can be used as a 
large-size storage medium for saving the operation result (edited video and the like) of the application software in ROM 
layer 1 7A. 

[0377] Furthermore, the AV addresses in units of EGG blocks have been explained as the addresses of the AV data 
structure. Alternatively, the addresses of AV data can be managed using, e.g., addresses in units of 2,048-byte sectors. 
[0378] FIG. 30 is a view for explaining another example of the hierarchical structure of information recorded on the 
optical disc shown in FIG. 1 . 

[0379] The recorded contents of data area DA correspond to those shown in FIG. 4 that has already been 
explained. 

[0380] More specifically, the recorded contents of audio/video data area DA2 in FIG. 30 correspond to those of 
audio/video data area DA2 shown in FIG. 4 as follows: 

navigation data (RTR_VMG) DA21a in FIG. 30 ... control information DA21 in FIG. 4; 

movie video object (RTR_MOV.VOB) DA22a in FIG. 30 ... video object DA22 in FIG. 4; 

still picture video object (RTR_STO.VOB) DA23a in FIG. 30 ... picture object DA23 in FIG. 4; 

additional audio object (RTR_STA.VOB) DA24a for still pictures in FIG. 30 ... audio object DA24 in FIG. 4; 

manufacturer specification object (MSRVOB) DA25a in FIG. 30 ... not shown in FIG. 4; and 

another stream object (AST.SOB) DA26a in FIG. 30 ... not shown in FIG. 4. 

[0381] Note that RTR is an abbreviation for real-time recording. 

[0382] Navigation data (RTR_VMG) DA21a is used upon controlling recording, playback, and editing of an AV 
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stream (one or more video objects VOBs). This RTR_VI\/IG lias all required navigation data as well as a single manage- 
ment information file called RTR.IFO. 

[0383] More specifically, navigation data (RTR_VI\/1G) DA21a includes RTR video management information 
(RTR_VMGI) DA210a, movie AV file information table (M_AVFIT) DA210b, still picture AV file information table 
5 (S_AVFIT) DA210C, original PGC information (ORG_PGCI) DA210d, user-defined PGC information table (UD_PGCI) 
DA210e, text data manager (TXTDT_MG) DA210f, and manufacturer information table (MN_FIT) DA210g. 
[0384] Those pieces of information (DA210a to DA210g) are successively recorded in file RTR.IFO in the afore- 
mentioned order. 

[0385] Most of information described in this file RTR.IFO is stored in a system memory (work RAM in MPU 30 in 

10 FIG. 27). 

[0386] RTR_VMGI/DA21 Oa describes basic information (information similar to video manager information VMGI in 
a DVD video ROM) of an RTR disc (disc 10 in FIG. 1). 

[0387] M_AVFIT_SA/DA210b describes a movie AV file corresponding to RTR_MOV.VRO in FIG. 35 (VRO is an 
abbreviation for a video recorder object). 
15 [0388] In correspondence with AV data control information DA210 in control information DA21 in FIG. 4, navigation 
data DA21a in FIG. 30 includes movie AV file information table (M_AVFIT) DA210b. 

[0389] This movie AV file information table (M_AVFIT) DA21 Ob includes movie AV file information table information 
(M_AVFITI) DA2100, one or more pieces of movie VOB stream information (M_VOB_STI#1 to M_VOB_STI#n) 
DA2102-1 to DA2102-n, and movie AVfile information (M_AVFI) DA2104. 

20 [0390] M_AVFI/DA21 04 describes information of a movie AV file having a file name "RTR_MOV.VRO". 

[0391] This movie AV file information (M_AVFI) DA2104 includes movie AV file information general information 
(M_AVFI_GI) DA21040, one or more movie VOB information search pointer #1 to #n (M_VOBI_SRP#1 to 
M_VOBI_SRP#n) DA21 042-1 to DA21042-n, and one or more pieces of movie VOB information #1 to #n (M_VOBI#1 
to M_VOBI#n) DA21 044-1 to DA21044-n. 

25 [0392] n pieces of M_VOBI in M_AVFI/DA2104 are described in the same order as that of VOB data stored in the 
movie AV file. 

[0393] Each movie VOB information (e.g., M_VOBI#n/DA21044-n) includes movie VOB general information 

M_VOBI_GI and time map information TMAPI. 

[0394] FIG. 31 exemplifies the contents of time map information TMAPI in FIG. 30, and also the correspondence 
30 between these contents and AV data control information DA210 in FIG. 4. 

[0395] Time map information TMAPI is used upon executing special playback (e.g., cell playback in the order 
unique to each user using user defined PGC) and time search. 

[0396] Time map information TMAPI includes time map general information TMAP_GI, one or more time entries 
TM_ENT#1 to TM_ENT#r, and one or more VOBU entries VOBU_ENT#1 to VOBU_ENT#q. 
35 [0397] Each VOBU entry contains information of the size and playback time of each VOBU. The VOBU size is pre- 
sented in units of sectors (2 kbytes), and the playback time is presented in units of video fields (one field = 1/60 sec in 
NTSC; one field = 1/50 sec in PAL). 

[0398] Since the VOBU size is presented in units of sectors, as described above, VOBUs can be accessed using 
addresses in units of sectors. 

40 [0399] On the other hand, each time entry contains address information of the corresponding VOBU, and time dif- 
ference information. This time difference information indicates the difference between the playback time designated by 
the time entry and the VOBU playback start time. 

[0400] Assuming that the time interval (time unit TMU) between two successive time entries is 10 sec, this time 
entry interval corresponds to 600 fields in, e.g., NTSC video. 
45 [0401] Each VOBU entry, e.g., VOBU entry #1, includes reference picture size information 1STREF_SZ, VOBU 
playback time information VOBU_PB_TM, and VOBU size information VOBU_SZ. 

[0402] Note that reference picture size information 1STREF_SZ represents the size of the first reference picture 
(corresponding to l-picture in MPEG) of the VOBU of interest in units of sectors. 

[0403] The VOBU general information included in cell VOBU table #m in FIG. 8 includes l-picture end position infor- 
50 mation, as shown in FIG. 9. The presence of the l-picture end position information means the presence of the size infor- 
mation from the start position (address) of the VOBU of interest to that l-picture end position. Therefore, 1STREF_SZ 
(corresponding to the l-picture size of VOBU#1) in FIG. 31 corresponds to the VOBU general information included in 
the cell VOBU table in FIG. 8. 

[0404] VOBU playback time information VOBU_PB_TM in FIG. 31 represents the playback time of the VOBU of 
55 interest in units of video fields. 

[0405] The time code table included in cell time general information in FIG. 8 contains information of the number of 
pictures and the number of sectors in a VOBU. Since the playback time of each VOBU changes depending on the 
number of pictures and the number of sectors included there, the time code table included in cell time general informa- 
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tion #m in FIG. 8 includes information corresponding to VOBU playback time information VOBU_PB_TI\/l in FIG. 31. 
[0406] VOBU size information VOBU_SZ in FIG. 31 represents the size of tine VOBU of interest in units of sectors. 
Since one EGG block corresponds to 16 sectors, this VOBU size information VOBU_SZ corresponds to information of 
the number of EGG blocks in a VOBU included in the time code table shown in FIG. 8. 
5 [0407] VOBU playback time information VOBU_PB_TM in FIG. 31 represents the playback time of each VOBU of 
interest in units of video fields. In general, since one frame = one picture = two fields, information VOBU_PB_TM in FIG. 
31 indicates the same information contents as the number of VOBU pictures in FIG. 8. 

[0408] In summary, cell time information GTI shown in FIG. 8 (and FIG. 4) includes VOBU general information cor- 
responding to 1STREF_SZ in the VOBU entry in FIG. 31 , and cell time general information (the number of VOBU pic- 

10 tures and the number of EGG blocks in a VOBU) corresponding to VOBU_PB_TM and VOBU_SZ in FIG. 31 . 

[0409] Therefore, cell time information GTl#m shown in FIG. 8 (and FIG. 4) conceptually has contents correspond- 
ing to the VOBU entry in FIG. 31 . 

[0410] Gell time information GTI#1 in FIG. 31 is included in cell time control information GTGI, which is included in 
AV data control information DA210 in FIG. 4. 
15 [041 1] That is, navigation data (RTR_VIVIG) in FIG. 30 corresponds to control information DA21 in FIG. 4 in a broad 
sense. 

[0412] Normally, the "time interval between neighboring VOBUs" is expressed by the number of fields in the VOBU 
entry. As another method, a "count value from a given VOBU to the next VOBU by a clock counter" may be used to 
express the "time interval between neighboring VOBUs". 
20 [0413] For example, the "time interval between neighboring VOBUs" can be expressed by the "difference value 
between the value of presentation time stamp PTS (see FIG. 6) at the start position of one VOBU and the value of PTS 
at the start position of the immediately succeeding VOBU". 

[0414] In other words, "the time interval in a specific unit can be expressed by the difference value of the clock coun- 
ter in that unit". Such unit can also be called a streamer object unit (SOBU). 
25 [0415] In addition, the following remarks will be given in association with the contents of the time code table shown 
in FIG. 8. 

[0416] In FIG. 8, the time code table is expressed by the number of VOBU pictures, and the number of EGG blocks 
in a VOBU. As another embodiment of the present invention, the following method may be used. That is, the number of 
fields (one picture = two fields) included in a VOBU may be used in place of the number of VOBU pictures. Furthermore, 
30 the number of sectors (one EGG block = 1 6 sectors) in an area where the VOBU of interest is recorded can be used in 
place of the number of EGG blocks in a VOBU. 

[0417] FIG. 32 exemplifies the contents of time map general information TMAP_GI shown in FIG. 31 . 

[0418] This time map general information TMAP_GI includes TM_ENT_Ns indicating the number of time entries in 

that time map information, VOBU_ENT_Ns indicating the number of VOBU entries in that time map information, time 
35 offset TM_OSF for that time map information, and address offset ADR_OFS of that time map information. 

[0419] When a value (10 seconds or equivalent) corresponding to 600 fields in NTSG video (or 500 fields in PAL 

video) is used as time unit TMU, time offset TM_OSF is used to represent the time offset within TMU. 

[0420] When the VOBU size is expressed by the number of sectors, address offset ADR_OFS is used to indicate 

the total size of preceding VOBs (one or more preceding VOBs) in an AV file. 
40 [0421] FIG. 33 exemplifies the contents of time entry TM_ENT shown in FIG. 31 . 

[0422] This time entry TM_ENT includes VOBU_ENTN indicating the number of the corresponding VOBU entry, 

TM_DIFF indicating the time difference between the playback time of a VOBU designated by the time entry, and the 

computed playback time, and VOBU_ADR indicating the target VOBU address. 

[0423] When time unit TMU is expressed by 600 fields in NTSG (or when time unit TMU is expressed by 500 fields 
45 in PAL), the "computed playback time" with respect to time entry #j is given by TMU x (j - 1 ) + TM_OSF . 

[0424] On the other hand, VOBU_ADR indicates the target VOBU address by the total size of VOBUs preceding the 
VOBU of interest when the VOBU size is expressed in units of sectors. 

[0425] FIG. 34 is a view for explaining the recorded contents of data area DA in FIG. 30, and a time entry point 
(access point) upon playing back a specific portion (e.g., VOBU#3) of the recorded contents. 
50 [0426] As has been described above with reference to FIG. 30, data area DA records movie video object 
RTR_MOB.VOB (DA22a-1 to DA22a-3), still picture video object RTR_STO.VOB (DA23a-1), a computer data file, and 
the like. 

[0427] For example, in one movie video object RTR_MOB.VOB (DA22a-1 ), its data extent/set #A stores data (video 
pack V_PGK, sub-picture pack SP_PGK, and the like) from logical block numbers LBN • a to LBN • a+b-1 . 
55 [0428] These logical block numbers correspond to predetermined movie addresses (M • ADR o to M • ADR b-1 ). 
A given portion of a set of these movie addresses corresponds to VOBU#1 , and the remaining portion corresponds to 
VOBU#2. 

[0429] Likewise, a set of movie addresses corresponding to a portion of a computer data file (extent #B) in data 
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area DA corresponds to VOBU#3. On the other hand, a set of movie addresses corresponding to the remaining portion 
of the computer data file (extent #B) and movie addresses (M • ADRb+f ) of a portion of another extent #C corresponds 
to VOBU#4, and a set of movie addresses of the remaining portion of extent #C corresponds to VOBU#5. 
[0430] in the aforementioned set of VOBUs, VOBU#1 to VOBU#3 mai<e up video object VOB#a, and VOBU#4 to 
5 VOBU#5 make up video object VOB#p. 

[0431] In the data structure exemplified above, in order to start playback from the middle of, e.g., VOBU#3, that 
access point must be determined. This access point is assumed to be a time entry point. 

[0432] In the example shown in FIG. 34, the time entry point is located at a position separated the time difference 
indicated by time difference information TiVI_DIFF in time entry Ti\/I_ENT in FIG. 33 from a position indicated by movie 
10 address information (M - ADR b) of VOBU#3. This time entry point serves as a special playback start point (or time 

search point) indicated by time map information TMAPI. 

[0433] FIG. 35 is a view for explaining an example of the directory structure of information (data files) recorded on 
the optical disc shown in FIG. 1 in the structure shown in FIG. 30. 

[0434] Even when the data structure shown in FIG. 30 is used on the disc/apparatus side, this data structure is 
15 invisible to the user. The data structure that the user can actually see is a hierarchical file structure shown in FIG. 35. 
[0435] More specifically, directories such as a DVD_RTR directory, VIDEO_TS directory, AUDIO_TS directory, 
computer data file directories, and the like are displayed on the display screen (not shown) of the root directory by 
means of menu windows, icons, or the like in correspondence with the types of data recorded on data area DA shown 
in FIG. 30. 

20 [0436] The DVD_RTR directory shown in FIG. 35 stores file RTR.IFO of navigation data RTR_VMG in FIG. 30, 
backup file RTR.BUP of RTR.IFO, file RTR_MOV.VRO of movie video object RTR_MOV.VOB, file RTR_STO.VRO of 
still picture video object RTR_STO.VOB, file RTR_STA.VRO of additional audio object RTR_STA.VOB for still pictures, 
manufacturer specification object file MSRVOB, another stream object file ASTSOB, and the like. 
[0437] When the DVD video recorder shown in FIG. 27 (RTR video recorder capable of real-time recording) has a 

25 directory display function shown in FIG. 35, and a DVD video ROM disc is set in its disc drive 32, the VIDEO_TS direc- 
tory in FIG. 35 is activated. In this case, when the user opens the VIDEO_TS directory, the recorded contents of the set 
disc are further displayed. 

[0438] When the apparatus shown in FIG. 27 has a DVD audio playback function, and a DVD audio disc is set in its 
disc drive 32, the AUDIO_TS directory in FIG. 35 is activated. In this case, when the user opens the AUDIO_TS direc- 
30 tory, the recorded contents of the set desk are further displayed. 

[0439] Likewise, when the apparatus shown in FIG. 27 has a computer data processing function, and a DVD-RAM 
(or DVD-ROM) disc that recorded computer data is set in its disc drive 32, the computer data directory in FIG. 35 is acti- 
vated. In this case, when the user opens the computer data directory, the recorded contents of the set desk are further 
displayed. 

35 [0440] The user can access the recorded sources of DVD video, DVD video ROM, DVD audio, and computer data 
(including programs) as if he or she were operating a personal computer, while observing a menu screen or window 
display screen displayed with the directory structure shown in FIG. 35. 

[0441] FIG. 36 is a schematic view for explaining a case wherein the cell playbackorder of the initially recorded con- 
tents (original PGC) has been changed by the user later using a user-defined PGC. 

40 [0442] For example, video data (video object set VOBS) recorded on audio/video data area DA2 in FIG. 5 is com- 
prised of a set of one or more program chains PGC. Each PGC is a set of programs as sets of one or more cells, and 
the playback order of cells that form each program can be determined by original PGC information 
(ORG_PGCI • DA210d in FIG. 30) or a user-defined PGC information table (UD_PGCI • DA210e in FIG. 30). 
[0443] The playback time and order of cells designated by the original PGC information or user-defined PGC infor- 

45 mation are converted into the addresses of VOBUs that form each of cells to be played back via a table (TMAP) in time 
map information TMAPI in FIG. 30. 

[0444] More specifically, when playback is made based on an original PGC (the cell playback order in the initially 
recorded state), the VOBU addresses in the time band to be played back are obtained via time map information table 
TMAP in accordance with the contents of ORG_PGCI in FIG. 30, and playback is made in that order. 
50 [0445] On the other hand, when playback is made based on a PGC uniquely defined by the user (e.g., when the 
user has edited the playback order after video recording), the VOBU addresses in the time band to be played back are 
obtained via time map information table TMAP in accordance with the contents of UD_PGCI in FIG. 30, and playback 
is made in that order. 

[0446] The cell playback order based on user-defined PGC information UD_PGCI can be quite different from that 
55 based on original PGC information ORG_PGCI. 

[0447] Note that the playback time and the addresses of VOBUs to be played back can correspond to each other 
by looking up the contents of the time entries and VOBU entries in time map information TMAPI shown in FIG. 31 . 
[0448] The l-picture audio position information in FIG. 9 expresses, using sectors as a unit, the differential address 
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value from the start position of a VOBU of a sector tliat includes an audio pack of the same time as the l-picture start 
time. However, the present invention is not limited to such sector unit, and the differential address may be expressed 
using the number of differential ECC blocks or the number of VOBUs that indicates a "shift" amount, depending on dif- 
ferent embodiments of the present invention. 

[0449] That is, the "shift" amount of a VOBU that includes an audio pack of the same time as the l-picture start time 
from a VOBU that includes the l-picture of interest can be expressed using the number of VOBUs. FIG. 37 shows this 
example. 

[0450] FIG. 37 is a view for explaining problems that will occur in audio data when video recording is interrupted 
before a GOP of MPEG-encoded video data comes to an end upon recording corresponding audio data together with 

IVIPEG-encoded video data. 

[0451] In a DVD video recorder that makes video recording while executing MPEG encoding, the user (or video 
recording timer) sometimes interrupts video recording before a GOP (from a given l-picture to a position immediately 
before the next l-picture) comes to an end. In such case, audio data recorded parallel to video data is interrupted at the 
same time. 

[0452] Upon playing back MPEG-encoded video recorded contents, since an incomplete GOP portion cannot be 
decoded, a process for completing that GOP by appending correction data to the incomplete GOP is done upon encod- 
ing. 

[0453] In this case, since there is no audio data for a portion (less than 0.5 sec if the playback time per GOP is 0.5 
sec) corresponding to the playback time of the correction data appended to complete the GOP, sound Is interrupted 
(abnormal sound is produced in some cases) upon video playback of that portion. Assume that this portion is called an 
audio gap. 

[0454] In order to cope with sound interrupt (or abnormal sound) due to such audio gap upon playback, the position 

of this audio gap must be detected. 

[0455] The time at which the audio gap is displayed matches the time at which the last data of VOBU#n-1 is dis- 
played with respect to video data. Therefore, the last display time of the audio gap period (= the time at which the next 
audio information is displayed, i.e., the time at which sound restarts) matches the time at which the first l-picture in 
VOBU#n as the next VOBU is displayed. Therefore, the l-picture audio position in FIG. 9 is information indicating the 
position of a VOBU which includes an audio pack as the audio gap end time of the same time as the l-picture start time. 
[0456] More specifically, the position of specific information such as the audio gap can be detected by exploiting the 
contents of the audio synchronization information shown in FIG. 9. 

[0457] That is, the start position (l-picture start time) of the "GOP completed by correction" can be specified using 
"l-picture audio position" information in the audio synchronization information in FIG. 9. 

[0458] The presence of specific information such as the audio gap behind the l-picture in the GOP can be detected 
based on the most significant bit = "0" of 1-byte "l-picture audio position information". 

[0459] Also, the audio sample position that corresponds to the specific information like the audio gap from the l-pic- 
ture start time of the GOP can be specified by the "l-picture start audio sample number" in the audio synchronization 
information in FIG. 9. 

[0460] Use of the "l-picture start audio sample number" in FIG. 9 is not limited to position detection of the audio gap, 
but FIG. 37 exemplifies that the audio synchronization information shown in FIG. 9 can be exploited in "audio gap posi- 
tion detection". 

[0461] According to the embodiment of the present invention, the following effects are obtained. 

(1) When the time map information (TMAPI in FIG. 31) is available, even when the user has changed the playback 
order using user-defined PGC information to be different from an original one, the VOBU from which playback is to 
start can be detected using the VOBU entries in time map information TMAPI. By rewriting the user-defined PGC 
information without re-recording data while changing the initial recording order, video playback can be made in an 
arbitrary order. 

(2) Since the cell configuration of a program chain to be recorded can be corrected as needed in correspondence 
with the performance of a disc drive used, seamless, continuous playback or recording can be implemented irre- 
spective of the disc drive used. 

(3) Since the audio synchronization information is provided, even when postrecording is done from various sound 
sources (digital sound sources generated at various sample rates) using dummy packs and the like (i.e., even when 
the sample rate of an original sound source recorded in audio packs is different from that of another sound source 
recorded in dummy packs by postrecording), synchronization (playback timing) between a video signal recorded in 
video packs and the postrecorded audio signal can be prevented from shifting. 

(4) Since the audio synchronization information is provided, even when multi-channel recording is done using var- 
ious sound sources (digital sound sources generated at various sample rates), synchronization (playback timing) 
of audio signals among channels can be prevented from shifting. 
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(5) Even when the contents of a specific area in video information are re-arranged on the information storage 
medium by, e.g., an edit process, continuous audio signal playbacl< can be made without any sound interrupt or the 
like. 

Claims 

1. An information storage medium which records and plays back data including video data and control information, 
characterized in that the control information includes information that accesses a specific portion of the video data. 

2. An information recording system which uses an information storage medium that can record data including video 
data and control information, 

characterized in that information for accessing a specific portion of the video data is described in the control infor- 
mation recorded on the information storage medium. 

3. An information recording system which uses an information storage medium in which data including video data and 
control information has been recorded, 

characterized in that when information for accessing a specific portion of the video data is described in the control 
information recorded on the information storage medium, the access information is read out from the information 
storage medium. 

4. An information storage medium which can record a plurality of pieces of video information at discrete positions, 
characterized in that the plurality of pieces of video information are recorded to decrease an access frequency to 
the video information to be not more than a predetermined number of times upon sequentially playing back the plu- 
rality of pieces of video information. 

5. An information storage medium according to claim 4, characterized in that information cells are recorded in units of 
ECC blocks each having a predetermined size. 

6. An information playback system which plays back recorded information from an information storage medium which 
can record a plurality of pieces of video information at discrete positions, 

characterized in that when an access frequency to the video information exceeds a predetermined number of times 
upon sequentially playing back the plurality of pieces of video information, recording locations of the plurality of 
pieces of video information are changed so that the access frequency does not exceed the predetermined number 
of times. 

7. An information recording/playback system which plays back recorded information from an information storage 
medium that can record a plurality of information cells at discrete positions in units of cells, and also can record 
management information that describes a sequence of these information cells, 

characterized in that when an access frequency to the video information exceeds a predetermined number of times 
upon sequentially playing back the plurality of pieces of video information, a description of the management infor- 
mation is changed so that the access frequency does not exceed the predetermined number of times. 

8. An information storage medium which records and plays back audio/video data including video information, audio 
information, and control information, 

characterized in that the control information describes audio synchronization information for taking synchronization 
between the video information and the audio information. 

9. An information recording/playback system which records and plays back audio/video data including video informa- 
tion, audio information, and control information on and from a predetermined information storage medium, 
characterized in that audio synchronization information is described in the control information, and when specific 
information in the video information is re-recorded at a different position on the information storage medium, audio 
information that synchronizes the video information is re-recorded at a different position on the information storage 
medium in accordance with the audio synchronization information. 

10. An information storage medium which records and plays back audio/video data including video information, audio 
information, and control information, 

characterized in that the control information describes information that takes a positional correspondence between 
a portion of the video information and the audio information. 
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1. An information storage metliod wlnicii uses a medium that records and piays back audio/video data inciuding video 

information, audio information, and control information, 

cliaracterized in tinat information tinat takes a positional correspondence between a portion of the video information 
and the audio information is recorded as a portion of the control information. 

2. An information storage medium which records and plays back data including video data and control information, 
characterized in that said medium has a first data size for recording the video data, and a second data size defined 
by a plurality of first data sizes, and the video data is recorded in units of sizes each corresponding to one or more 
of the second data sizes. 

3. An information recording/playback system which records video data and control information on an information stor- 
age medium that has a first data size for recording the video data, and a second data size defined by a plurality of 
first data sizes, 

characterized in that the video data is recorded or edited in units of sizes each corresponding to one or more of the 
second data sizes. 
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